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__Mianual Update 


Document Title: Second-Generation TMS320 Users Guide 


Document Number: SPRU014 ECN Number: 526628 


The following are changes to the Second-Generation TMS320 User’s Guide. These changes will 
be incorporated in the next revision of the manual. Bars in the right margin indicate changes or 
additions to the manual. 


Page Change or Add 
3-25 On the top line following the words “which do not affect the accumulator” add a 


footnote flag (t)and, on the bottom of the page, add the following footnote: 


TtBIT instruction may affect the accumulator on the TMS32020 under certain Al 
circumstances. Refer to Section 4.3, Page 4-45. 


4-45 Under the Execution heading, following the words “Affects TC”, add the following 
warning: . 


Caution: See note on next page concerning execution by TMS32020. 


Sis TEXAS 
INSTRUMENTS 


SPRZ047 


Page Change or Add 
4-46 Under the example given for the Test Bit instruction add the following note: 
Note: 
This instruction may affect the contents of the accumulator on the TMS32020 
if the following conditions occur: 
1) Overflow mode set (OVM status register bit is set). 
2) Two LSBs of BIT instruction opcode (bits 8 and 9 of the instruction word) 
are zero. 
3) Addition of accumulator contents with contents of addressed memory 
would cause accumulator overflow. 
If all of the above conditions are met, the contents of the accumulator will be 
replaced by the positive or negative saturation value, depending on the polarity 
of the overflow. 
This situation can be avoided by any one of the following means: 
Precede BIT instruction with ROVM, and follow BIT with SOVM instruction. 
If direct addressing is being used, reorganize memory so that page relative 
locations 0, 4, 8, C, and 10 are not used. 
lf indirect addressing is used, select new ARP that is not ARO or AR4, and 
restore code later (if necessary) with LARP ARO/4. 
Use BITT instead of BIT. BITT does not affect accumulator under any 
circumstances. 
@ Use TMS320C25 (pin and object code compatible) instead. 
This situation occurs only when the BIT instruction is executed by a TMS32020 
that is in the saturation mode. 
Page Change or Add 
5-32 Following the last paragraph of section 5.5.2 (under the words “past processing”) 


add the following note: 


Note: 


Under certain circumstances, executing the BIT instruction may affect the 


contents of the accumulator on the TMS32020 device. For more information, refer 
to the instruction definition in Section 4.3 Page 4-45. 
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IMPORTANT NOTICE 


Texas Instruments (TI) reserves the right to make changes to or to 
discontinue any semiconductor product or service identified in this 
publication without notice. Tl advises its customers to obtain the latest 
version of the relevant information to verify, before placing orders, 
that the information being relied upon is current. 


Tl. warrants performance of its semiconductor products to current 
specifications in accordance with TI’s standard warranty. Testing and 
other quality control techniques are utilized to the extent Tl deems 
necessary to support this warranty. Uniess mandated by government 
requirements, specific testing of all parameters of each device is not 
necessarily performed. 


Tl assumes no liability for T! applications assistance, customer product 
design, software performance, or infringement of patents or services 
described herein. Nor does TI warrant or represent that any license, 
either express or implied, is granted under any patent right, copyright, 
mask work right, or other intellectual property right of T! covering or 
relating to any combination, machine, or process in which such 
semiconductor products or services might be or are used. 


Copyright © 1987, Texas Instruments Incorporated 
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1. 


Introduction 


The TMS320 family of 16/32-bit single-chip digital signal processors com- 
bines the flexibility of a high-speed controller with the numerical capability of 
an array processor, offering an inexpensive alternative to custom VLSI and 
multichip bit-slice processors for signal processing. 


The TMS32010, the first digital signal processor in the TMS320 family, was 
introduced in 1983. Since that time, the TMS320 family has established itself 
as the industry standard for digital signal processing. The powerful instruction 
set, inherent flexibility, high-speed number-crunching capabilities, and inno- 
vative architecture have made the high-performance, cost-effective processors 
in the TMS320 family the ideal solution to many telecommunications, com- 
puter, commercial, industrial, and military applications. 


The TMS320 family has now expanded into three generations of processors: 
TMS320C1 x, TMS320C2x, and TMS320C3x (see Figure 1-1). Many features 
are common among these generations. Some specific features are added in 
each processor to provide different cost/performance tradeoffs. Software 
compatibility is maintained throughout the family to. protect the user’s invest- 
ment in architecture. Each processor has software and hardware tools to 
facilitate rapid design. 
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Figure 1-1. TMS320 Device Evolution 
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This document discusses the second-generation devices (TMS320C2x) 
within the TMS320 family. The specific members of the second-generation 
TMS320 include: 


e TMS32020, an NMOS 20-MHz digital signal processor capable of twice 
the performance of the TMS320C1x devices, and 


@ §TMS320C25, a CMOS 40-MHz version of the TMS32020 with twice 
the performance of the TMS32020. 


Plans for expansion of the TMS320 family include more spinoffs of the exist- 
ing generations as well as more powerful future generations of digital signal 
processors. 


The TMS320 family combines the high performance and specialized features 
necessary in digital signal processing (DSP) applications with an extensive 
program of development support, including hardware and software develop- 
ment tools, product documentation, textbooks, newsletters, DSP design 
workshops, and a variety of application reports. See Appendix D for a dis- 
cussion of the wide range of development tools available. 
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1.1 General Description 


The combination of the TMS320’s Harvard-type architecture (separate pro- 
gram and data buses) and its special digital signal processing (DSP) instruc- 
tion set provide speed and flexibility to produce a microprocessor family 
capable of executing 10 MIPS (million instructions per second). The TMS320 
family optimizes speed by implementing functions in hardware that other 
processors implement through software or microcode. This hardware-inten- 
sive approach provides the design engineer with power previously unavailable 
on a single chip. 


The second generation of the TMS320 family includes two members, the 
TMS32020 and the TMS320C25. The architecture of these devices is based 
upon that of the TMS32010. Table 1-1 provides an overview of the 
TMS320C2x group of processors with comparisons of technology, memory, 
1/0, cycle timing, and package type. 


Table 1-1. TMS320C2x Processors Overview 


MEMORY ot CYCLE| PACKAGE 
DEVICE | TECH | ON-CHIP OFF-CHIP TIME | TYPE 
RAM ROM PROG DATA|SER PAR DMA] (ns) |PGA PLCC 
TMS32020¢ | NMOS | 544 - 64K 64K | YES 16x16 YES | 200 | 68 - | 
TMS320C258| CMOS | 544 4K 64K 64K | YES 16x16 CON {| 100 | 68 68 


TSER = serial; PAR = parallel; DMA = direct memory access; CON = concurrent DMA. 
tMilitary version available; contact nearest TI sales office for details. 
§Military version planned; contact nearest TI sales office for availability. 


The TM$S32020, processed in NMOS technology, is source-code upward 
compatible with the TMS32010 and in many applications is capable of two 
times the throughput of the TMS320C1x devices. It provides an enhanced 
instruction set (109 instructions), large on-chip data memory (544 words), 
large memory spaces, on-chip serial port, and a hardware timer. 


The TMS320C25, the newest member of the TMS320 second generation, is 
processed in CMOS technology. The TMS320C25 is capable of executing 
many instructions in a 100-ns cycle time. It is pin-for-pin and object-code 
upward compatible with the TMS32020. The TMS320C25’s enhanced fea- 
ture set greatly increases the functionality of the device over the TMS32020. 
Enhancements include 24 additional instructions (133 total), eight auxiliary 
registers, an eight-level hardware stack, 4K words of on-chip program ROM, 
a bit-reversed indexed-addressing mode, and the low-power dissipation in- 
herent to the CMOS process. 
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1.2 Key Features 
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Some of the key features of the TMS320C2x devices are listed below. Features 
specific to a particular device are noted by enclosing the device name in pa- 
rentheses. 


Instruction cycle timing: 
= 100 ns (TMS320C25) 
= 200 ns (TMS32020) 


544-word programmable on-chip data RAM 
4K-word on-chip program ROM (TMS320C25) 
128K-word total data/program memory space 
32-bit ALU/accumulator 

16 x 16-bit parallel multiplier with a 32-bit product 
Single-cycle multiply/accumulate instructions 


Repeat instructions for efficient use of program space and enhanced 
execution 


Block moves for data/program management 

On-chip timer for contro! operations 

Up to eight auxiliary registers with dedicated arithmetic unit 

Up to eight-level hardware stack 

Sixteen input and sixteen output channels 

16-bit parallel shifter 

Wait states for communication to slower off-chip memories/peripherals 
Serial port for direct codec interface 

Synchronization input for synchronous multiprocessor configurations 
Global data memory interface 

TMS320C1x source-code upward compatibility 

Concurrent DMA using an extended hold operation (TMS320C25) 


Instructions for adaptive filtering, FFTs, and extended-precision arith- 
metic (TMS320C25) 


Bit-reversed indexed-addressing mode for radix-2 FFTs (TMS320C25) 
On-chip clock generator 

Single 5-V supply 

Device packaging: 

= 68-pin PGA 

-  68-lead PLCC (TMS320C25) 


Technology: 
= NMOS (TMS32020) 
CMOS (TMS320C25) 


Commercial and military versions available. — 
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1.3 Typical Applications 


The TMS320 family’s unique versatility and realtime performance offer flexible 
design approaches in a variety of applications. In addition, TMS320 devices 
can simultaneously provide the multiple functions often required in those 
complex applications. Table 1-2 lists typical TMS320 family applications. 


Table 1-2. Typical Anvplications of the TMS320 Family 


GENERAL-PURPOSE DSP GRAPHICS/IMAGING INSTRUMENTATION 


Digital Filtering 3-D Rotation Spectrum Analysis 
Convolution Robot Vision Function Generation 
Correlation Image Transmission/ Pattern Matching 
Hilbert Transforms Compression Seismic Processing 
Fast Fourier Transforms Pattern Recognition Transient Analysis 
Adaptive Filtering Image Enhancement Digital Filtering 
Windowing Homomorphic Processing Phase-Locked Loops 
Waveform Generation Workstations 

Animation/Digital Map 


VOICE/SPEECH CONTROL MILITARY 


Voice Mail Disk Control Secure Communications 
Speech Vocoding Servo Control Radar Processing 

Speech Recognition Robot Control Sonar Processing 

Speaker Verification Laser Printer Control Image Processing 

Speech Enhancement Engine Control Navigation 

Speech Synthesis ‘Motor Control Missile Guidance 
Text-to-Speech Radio Frequency Modems 


TELECOMMUNICATIONS AUTOMOTIVE 


Echo Cancellation FAX Engine Control 
ADPCM Transcoders Cellular Telephones Vibration Analysis 
Digital PBXs Speaker Phones Antiskid Brakes 

Line Repeaters Digital Speech Adaptive Ride Control 
Channel! Multiplexing Interpolation (DSI) Global Positioning 
1200 to 19200-bps Modems — X.25 Packet Switching Navigation 

Adaptive Equalizers Video Conferencing Voice Commands 
DTMF Encoding/Decoding Spread Spectrum Digital Radio 

Data Encryption Communications Cellular Telephones 


CONSUMER INDUSTRIAL MEDICAL 


Radar Detectors Robotics Hearing Aids 

Power Tools Numeric Control Patient Monitoring 
Digital Audio/TV Security Access Ultrasound Equipment 
Music Synthesizer Power Line Monitors Diagnostic Tools 
Toys and Games Prosthetics 
Solid-State Answering Fetal Monitors 
Machines 


Many of the TMS320C2x features, such as single-cycle multiply/accumulate 
instructions, 32-bit arithmetic unit, large auxiliary register file with a separate 
arithmetic unit, and large on-chip RAM and ROM, make the device particularly 
applicable in digital signal processing systems. At the same time, general- 
purpose applications are greatly enhanced by the large address spaces, on- 
chip timer, serial port, multiple interrupt structure, provision for external wait 
states, and capability for multiprocessor interface and direct memory access. 
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The TMS320C2x provides the flexibility to be configured to satisfy a wide 
range of system requirements. This allows the device to be applied in systems 
currently using costly bit-slice processors or custom ICs. Some of the system 
configurations are: 


e A standalone system using on-chip memory, 
e Parallel multiprocessing systems with shared global data memory, or 
@ Host/peripheral coprocessing using interface control signals. 
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1.4 How To Use This Manual 


The purpose of this user’s guide is to serve as a reference book for the 
TMS320C2x digital signal processors. Sections 2 through 6 provide specific 
information about the architecture and operation of the device. Electrical 
specifications and mechanical data can be found in the data sheet (Appendix 


A). 


The following table lists each section and briefly describes the section con- 


tents. 


Section 2. 


Section 3. 


Section 4. 


Section 5. 


Section 6. 


Pinouts and Signal Descriptions. Drawings of the PGA and 
PLCC packages for TMS320C2x devices. Functional list- 
ings of the signals, their pin locations, and descriptions. 


Architecture. TMS320C2x design description, hardware 
components, and device operation. Functional block dia- 
gram and internal hardware summary table. 


Assembly Language Instructions. Addressing modes and 
format descriptions. Instruction set summary listed ac- 
cording to function. Alphabetized individual instruction 
descriptions with examples. 


Software Applications. Software application examples for 
the use of various TMS320C2x instruction set features. 


Hardware Applications. Hardware design techniques and 
application examples for interfacing to memories, periph- 
erals, or other microcomputers/microprocessors. XDS de- 
sign considerations. System applications. 


Eight appendices are included to provide additional information. 


Appendix A. 


Appendix. B. 


Appendix C. 


Appendix D. 


Appendix E. 


Second-Generation TMS320 Data Sheet. Electrical spec- 
ifications, timing, and mechanical data for the TMS320C2x 
devices. 


SMJ32020 Data Sheet. Electrical specifications, timing, 
and mechanical data for this military devices. 


TMS320C2x System Migration. Information for upgrading 
a TMS320C1x to a TMS32020-based system and a 
TMS32020 to a TMS320C25-based system. 


TMS320C2x Instruction Cycle Timings. Listings of the 
number of cycles for an instruction to execute in a given 
memory configuration on the TMS32020 and_ the 
TMS320C25. 


Development Support/Part Order Information. Listings of 
the hardware and software available to support the 
TMS320C2x devices. 
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Appendix F. Memories, Analog Converters, Sockets, and Crystals. 
Listings of the Tl memories, analog conversion devices, 
and sockets available to support the TMS320C2x devices 
in DSP applications. Crystal specifications and vendors. 


Appendix G. ROM Codes. Discussion of ROM codes (mask options) 
and the procedure for implementation. 


Appendix H. Quality and Reliability. Discussion of Texas Instruments 
quality and reliability criteria for evaluating performance. 
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1.5 References 


The following reference list contains useful information regarding functions, 
operations, and applications of digital signal processing. These books also 
provide other references to many useful technical papers. The reference list is 
organized into categories of general DSP, speech, image processing, and di- 
gital control theory, and alphabetized by author. 


General Digital Signal Processing: 


Antoniou, Andreas, Digital Filters: Analysis and Design. New York, NY: 
McGraw-Hill Company, Inc., 1979. 


Brigham, E. Oran, The Fast Fourier Transform. Englewood Cliffs, NJ: 
Prentice-Hall, Inc., 1974. 


Burrus, C.S. and Parks, T.W., DFT/FFT and Convolution Algorithms. 
New York, NY: John Wiley and Sons, Inc., 1984. 


Digital Signal Processing Applications with the TMS320 Family, Texas 
Instruments, 1986; Prentice-Hall, Inc., 1987. 


Gold, Bernard and Rabiner, Lawrence R., Theory and Application of 
Digital Signal Processing. Englewood Cliffs, NJ: Prentice-Hall, Inc., 
1975. 


Gold, Bernard and Rader, C.M., Digital Processing of Signals. New 
York, NY: McGraw-Hill Company, Inc., 1969. 


Hamming, R.W., Digital Filters. Englewood Cliffs, NJ: Prentice-Hall, 
Inc., 1977. 


IEEE ASSP DSP Committee (Editor), Programs for Digital Signal Pro- 
cessing. New York, NY: IEEE Press, 1979. 


Jackson, Leland B., Digital Filters and Signal Processing. Hingham, MA: 
Kluwer Academic Publishers, 1986. 


Jones, D.L. and Parks, T.W., A Digital Signal Processing Laboratory 
Using the TMS32070. Englewood Cliffs, NJ: Prentice-Hall, Inc., 1987. 


Morris, L. Robert, Digital Signal Processing Software. Ottawa, Canada: 
Carieton University, 1983. 


Oppenheim, Alan V. (Editor), Applications of Digital Signal Processing. 
Englewood Cliffs, NJ: Prentice-Hall, Inc., 1978. 


Oppenheim, Alan V. and Schafer, R.W., Digital Signal Processing. En- 
glewood Cliffs, NJ: Prentice-Hall, Inc., 1975. 


Oppenheim, Alan V. and Willsky, A.N. with Young, I.T., Signals and 
Systems. Englewood Cliffs, NJ: Prentice-Hall, Inc., 1983. 


Parks, T.W. and Burrus, C.S., Digital Filter Design. New York, NY: John 
Wiley and Sons, Inc., 1987. 
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Speech: 


Gray, A.H. and Markel, J.D., Linear Prediction of Speech. New York, 
NY: Springer-Verlag, 1976. 


Jayant, N.S. and Noll, Peter, Digital Coding of Waveforms. Englewood 
Cliffs, NJ: Prentice-Hall, Inc., 1984. 


Papamichalis, Panos, Practical Approaches to Speech Coding. Engle- 
wood Cliffs, NJ: Prentice-Hall, Inc., 1987. 


Rabiner, Lawrence R. and Schafer, R.W., Digita/ Processing of Speech 
Signals. Englewood Cliffs, NJ: Prentice-Hall, Inc., 1978. 


Image Processing: 


Andrews, H.C. and Hunt, B.R., Digital Image Restoration. Englewood 
Cliffs, NJ: Prentice-Hall, Inc., 1977. 


Gonzales, Rafael C. and Wintz, Paul, Digital /mage Processing. Reading, 
MA: Addison-Wesley Publishing Company, Inc., 1977. 


Pratt, William K., Digital Image Processing. New York, NY: John Wiley 
and Sons, 1978. 


Digital Control Theory: 


Jacquot, R., Modern Digital Control! Systems. New York, NY: Marcel 
Dekker, Inc., 1981. 


Katz, P., Digital Control Using Microprocessors. Englewood Cliffs, NJ: 
Prentice-Hall, Inc., 1981. 


Kuo, B.C., Digital Control Systems. New York, NY: Holt, Reinholt and 
Winston, Inc., 1980. 


Moroney, P., /ssues in the Implementation of Digital Feedback Com- 
pensators. Cambridge, MA: The MIT Press, 1983. 


Phillips, C. and Nagle, H., Digita/ Contro/ System Analysis and Design. 
Englewood Cliffs, NJ: Prentice-Hall, Inc., 1984. 


2. Pinouts and Signal Descriptions 


The TMS320C2x (second-generation TMS320) digital signal processors are 
available in a 68-pin grid array (PGA) package. The TMS320C25 is alsa 
packaged in a 68-pin plastic-leaded chip carrier (PLCC). 


Adaptor sockets are commercially available to convert a TMS320C25 PLCC - 
package to a TMS32020-like 68-pin grid array (PGA) footprint, thus main- 
taining plug-in compatibility. 


When using the XDS emulator, refer to Section 6.1.3 for user target design 
considerations 


This section provides the pinouts and signal definitions in the following sub- 
sections: 


° TMS320C2x Pinouts (Section 2.1 on page 2-2) 
@ TMS320C2x Signal Descriptions (Section 2.2 on page 2-3) 


Electrical specifications and mechanical data are given in Appendix A, the 
Second-Generation TMS320 Data Sheet. 
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2.1 TMS320C2x Pinouts 


Figure 2-1 shows pinouts of the PGA packages for the TMS320C2x devices 
and the PLCC package for the TMS320C25. Refer to Section 6.1.3 for user 
target system design considerations when using the XDS. 
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2.2 TMS320C2x Signal Descriptions 


The signal descriptions for the TMS320C2x devices are provided in this sec- 
tion. Table 2-1 lists each signal, its pin location (PGA/PLCC), function, and 
operating mode(s), i.e., input, Output, or high-impedance state as indicated 
by 1, O, or Z. The signals in Table 2-1 are grouped according to function and 
alphabetized within that grouping. 
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Table 2-1. TMS320C2x Signal Descriptions 


SIGNAL PIN 1/0/zt DESCRIPTION 
= ee 


FAD DRESS/DATABUSES ss ss—<“‘i‘—‘—sisSCSCSY BUSES 


A15 MSB L10/43 Parallel address bus A15 (MSB) through AO (LSB). 
K9/42 Multiplexed to address external data/program memory or 
L9/41 1/O. Placed in high-impedance state in the hold mode. 
K8/40 
18/39 
K7/38 
L7/37 
K6/36 
K5/34 
L5/33 
K4/32 
L4/31 
K3/30 
L3/29 
K2/28 
K1/26 


D15 MSB Parallel data bus D15 (MSB) through DO (LSB). 

D14 Multiplexed to transfer data between the TMS320C2x and 
D13 external data/program memory or !/O devices. Placed ine 
D12 high-impedance state when not outputting or when RS or 
D11 HOLD is asserted. 

D1i0 

D9 

D8 


INTERFACE CONTROL SIGNALS 


K10/45 
J10/47 
J11/46 


Data, program, and !/O space select signals. Always high 
unless low level asserted for communicating to a 
particular external space. Placed in high-impedance 
state in the hold mode. 


B8/66 Data ready input. Indicates that an external device is pre- 
pared for the bus transaction to be completed. If the device 
is not ready (READY = Q), the TMS320C2x waits one cycle 
and checks READY again. READY also indicates a bus grant 
to an external device after a BR (bus request) signal. 
H11/48 Read/write signal. Indicates transfer direction when com- 

municating to an external device. Normally in read mode | 
= _ Le 


(high), unless low level asserted for performing a write op- 
eration. Placed in high-impedance state in the hold mode. 
T Input/Output/High-impedance state 


Strobe signal. Always high unless asserted low to indicate 
an external bus cycle. Placed in high-impedance state in the 
hold mode. 
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Table 2-1 TMS320C2x Signal Descriptions (Continued) 


SIGNAL PIN i/O/2t DESCRIPTION 
(PGA/PLCC) 


MULTIPROCESSING SIGNALS 
G11/50 Bus request signal. Asserted when the TMS320C2x requires 
access to an external global data memory space. READY is 
asserted to the device when the bus is available and the 
global data memory is available for the bus transaction. 
HOLD A7/67 Hold input. When asserted, the TMS320C2x places the data, 
address, and control lines in the high-impedance state. 
HOLDA E10/55 — Hold acknowledge signal. Indicates that the TMS320C2x 
has gone into the hold mode and that an external processor 
may access the !ocal external memory of the TMS320C2x. 
SYNC F2/19 eee Synchronization input. Allows clock synchronization of two 


or more TMS320C2x’s. SYNC is an active-low signal and 
INTERRUPT AND MISCELLANEOUS SIGNALS 


must be asserted on the rising edge of CLKIN. 
nil — eo 
- _ 


Branch control input. Polled by BIOZ instruction. If low, the 
TMS320C2x executes a branch. This signal must be active 
during the BIOZ instruction fetch. 


Interrupt acknowledge signal. Output is only valid while 
CLKOUT1 is low. Indicates receipt of an interrupt and that 
the program is branching to the interrupt-vector location 
indicated by A15-A0. 


INT2 External user interrupt inputs. Prioritized and maskable 
INT1 by the interrupt mask register and the interrupt mode bit. 
INTO 


Microprocessor/microcomputer mode select pin for the 
TMS320C25 only. When asserted low (microcomputer 
mode), the pin causes the internal ROM to be mapped into 
the lower 4K words of the program memory map. In the 
microprocessor mode, the lower 4K words of program me- 
mory are external. On the TMS32020, MP/MC must be 
connected to Vcc. 


MP/MC A6/1 


a x ¥ 
- — uy 
ae — | 


Tt Input/Output/High-impedance state 


Microstate complete signal. Asserted low and valid only 
during CLKOUT1 low when the TMS320C2x has just com- 
pleted a memory operation, such as an instruction fetch or 
a data memory read/write. MSC can be used to generate a 
one wait-state READY signal for slow memory. 


Reset input. Causes the TMS320C2x to terminate execution 
and forces the program counter to zero. When brought to a 
high level, execution begins at location zero of program 
memory. RS affects various registers and status bits. 


External flag output (latched software-programmable sig- 
nal). Used for signalling other processors in multiprocessor 
configurations or as a general-purpose output pin. 
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Table 2-1. TMS320C2x Signal Descriptions (Concluded) 


SIGNAL PIN 1/0/zt DESCRIPTION 
(PGA/PLCC) 


SUPPLY/OSCILLATOR SIGNALS 


Master clock output signal (CLKIN frequency/4). In this 
document (and on the TMS320C25), CLKOUT11 rises at the 
beginning of quarter-phase 3 (Q3) and falls at the begin- 
ning of quarter-phase 1 (Q1). See Appendix C for device 
phase definitions. 


A second clock output signal. In this document (and on the 
TMS320C25), CLKOUTZ2 rises at the beginning of quar- 
ter-phase 2 (Q2) and falls at beginning of quarter-phase 4 
(Q4). See Appendix C for device phase definitions. 


Four 5-V supply pins, tied together externally. On the 
TMS32020, pin AG is also a supply pin. 


CLKOUT1 C11/58 | 
CLKOUT2 D10/57 Ls 
Vee 


A10/61 
B10/62 


H2/23 
L6/35 


Vss B1/10 Three ground pins, tied together externally. 
K11/44 
L2/27 
x1 G10/51 Output pin from the internal oscillator for the crystal. If a 
crystal is not used, this pin should be left unconnected. 
X2/CLKIN F11/52 Input pin to the internal oscillator from the crystal. If a 
crystal is not used, a clock may be input to the device on this 
pin. 


SERIAL PORT SIGNALS 
- 
— 


ic i 
ial taal Si 


Receive clock input. External clock signal for clocking data 
from the DR (data receive) pin into the RSR (serial port re- 
ceive shift register). Must be present during serial port 
transfers. 


Transmit clock input. External clock signal for clocking data 
from the XSR (serial port transmit shift register) to the DX 
(data transmit) pin. Must be present during serial port 
transfers. 


Serial data receive input. Serial data is received in the RSR 
(serial port receive shift register) via the DR pin. 


Serial data transmit output. Serial data transmitted from the 
XSR (seria! port transmit shift register) via the DX pin. 
Placed in high-impedance state when not transmitting. 


Frame synchronization pulse for receive input. The falling 
edge of the FSR pulse initiates the data-receive process, 
beginning the clocking of the RSR. 


DX 
FSR J2/25 
FSX 


= - 


T Input/Output/High-impedance state 


Frame synchronization pulse for transmit input/output. The 
falling edge of the FSX pulse initiates the data-transmit 
process, beginning the clocking of the XSR. Following re- 
set, the default operating condition of FSX is as an input. 
This pin may be selected by software to be an output when 
the TXM bit in the status register is set to 1. 
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The architectural design of the TMS320C2x (second-generation TMS320) 
emphasizes overall system speed, communication, and flexibility in processor 
configuration. Control signals and instructions provide block memory trans- 
fers, communication to slower off-chip devices, and multiprocessing imple- 
mentations. Increased throughput for many DSP applications is accomplished 
by single-cycle multiply/accumulate instructions, two large on-chip RAM 
blocks, eight auxiliary registers with a dedicated arithmetic unit, a serial port, 
hardware timer, faster 1/O for data-intensive signal processing, and other fea- 
tures. 


Major topics discussed in this section are listed below. 


° Architectural Overview (Section 3.1 on page 3-2) 
Functional Block Diagram (Section 3.2 on page 3-5) 
Internal Hardware Summary (Section 3.3 on page 3-7) 


Memory Organization (Section 3.4 on page 3-11) 
Data memory and program memory 
Memory maps and memory-mapped registers 
Auxiliary registers 
Memory addressing modes 
Memory-to-memory moves 


° Central Arithmetic Logic Unit (CALU) (Section 3.5 on page 3-22) 
Scaling shifter, ALU, and accumulator 
Multiplier, T and P registers 


© System Control (Section 3.6 on page 3-28) 
Program counter and stack 
Pipeline operation 
Reset 
Status registers 
Timer operation 
Repeat counter 
Powerdown mode 


°e External Memory and !/O Interface (Section 3.7 on page 3-39) 
Memory combinations 
Internal clock timing relationships 
External read and write cycles 
General-purpose I/O pins (B10 and XF) 


@ Interrupts (Section 3.8 on page 3-47) 
Interrupt operation 
External interrupt interface 


e Serial Port (Section 3.9 on page 3-51) 
Transmit and receive operations 
Timing and framing control 
Burst mode and continuous mode operation 


e@ Multiprocessing and Direct Memory Access (Section 3.10 on page 
3-63) 
Synchronization 
Global memory 
The hold function 
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3.1 Architectural Overview 
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The TMS320C2x high-performance digital signal processors, like the 
TMS320C1x devices, implement a Harvard-type architecture that maximizes 
processing power by maintaining two separate memory bus structures, pro- 
gram and data, for full-speed execution. Instructions are included to provide 
data transfers between the two spaces. Externally, the program and data 
memory can be multiplexed over the same bus so as to maximize the address 
range for both spaces while minimizing the pin count of the device. 


Increased flexibility in system design is provided by two large on-chip data 
RAM blocks (a total of 544 16-bit words), one of which is configurable either 
as program or data memory (see Figure 3-1). An off-chip 64K-word directly 
addressable data memory address space is included to facilitate implementa- 
tions of DSP algorithms. 


The large on-chip 4K-word masked ROM on the TMS320C25 can be used to 
cost-reduce systems, thus providing for a true single-chip DSP solution (see 
Figure 3-1). Programs of up to 4K words can be masked into the internal 
program ROM. The remainder of the 64K-word program memory space is lo- 
cated externally. Large programs can execute at full speed from this memory 
space. Programs may also be downloaded from slow external memory to on- 
chip RAM for full-speed operation. 


+5V GND 


DATA (16) 


INTERRUPTS 256-WORD | 288-WORD 
—_——————»—| JDATA/PROG |! DATA 
RAM RAM 


4K-WORDS ROM 
(TMS320C25) 
MULTIPLIER 


32-BIT ALU/ACC 


SHIFTERS 
TIMER 


Figure 3-1. TMS320C2x Simplified Block Diagram 


MULTIPROCESSOR 
INTERFACE 


SERIAL INTERFACE 


ADDRESS (16) 


The TMS320C2x performs two’s-complement arithmetic using the 32-bit ALU 
and accumulator. The ALU is a general-purpose arithmetic unit that operates 
using 16-bit words taken from data RAM or derived from immediate in- 
structions or using the 32-bit result of the multiplier’s product register. In 
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addition to the usual arithmetic instructions, the ALU can perform Boolean 
operations, providing the bit manipulation ability required of a high-speed 
controller. The accumulator stores the output from the ALU and is the second 
input to the ALU. The accumulator is 32 bits in length and is divided into a 
high-order word (bits 31 through 16) and a low-order word (bits 15 through 
0). Instructions are provided for storing the high- and low-order accumulator 
words in memory. 


The multiplier performs a 16 x 16-bit two’s-complement multiplication with a 
32-bit result in a single instruction cycle. The multiplier consists of three ele- 
ments: the T Register, P Register, and multiplier array. The 16-bit T Register 
temporarily stores the multiplicand; the P Register stores the 32-bit product. 
Multiplier values either come from data memory, from program memory when 
using the MAC/MACD instructions, or are derived immediately from the 
MPYK (multiply immediate) instruction word. The fast on-chip multiplier al- 
lows the device to efficiently perform fundamental DSP operations such as 
convolution, correlation, and filtering. 


The TMS320C2x scaling shifter has a 16-bit input connected to the data bus 
and a 32-bit output connected to the ALU. The scaling shifter produces a 
left-shift of O to 16 bits on the input data, as programmed in the instruction. 

_ The LSBs of the output are filled with zeros, and the MSBs may be either filled 
with zeros or sign-extended, depending upon the state of the sign-extension 
mode bit of status register ST1. Additional shift capabilities enable the pro- 
cessor to perform numerical scaling, bit extraction, extended arithmetic, and 
overflow prevention. 


The TMS320C2x local memory interface consists of a 16-bit parallel data bus 
(D15-D0), a 16-bit address bus (A15-A0), three pins for data/program me- 
mory or |/O space select (DS, PS, and IS), and various system control signals. 
The R/W signal controls the direction of a data transfer, and the STRB signal 
provides a timing signal to control the transfer. When using on-chip program 
RAM, ROM, or high-speed external program memory, the TMS320C2x runs 
at full speed without wait states. The use of a READY signal allows wait-state 
generation for communicating with slower off-chip memories. 


Up to eight levels of hardware stack are provided for saving the contents of 
the program counter during interrupts and subroutine calls. Instructions are 
available for saving the device’s complete context. PUSH and POP in- 
structions permit a level of nesting restricted only by the amount of available 
RAM. The interrupts used in these devices are maskable. 


Control operations are supported on the TMS320C2x by an on-chip memo- 
ry-mapped 16-bit timer, a repeat counter, three external maskable user inter- 
rupts, and interna! interrupts generated by serial port operations or by the 
timer. A built-in mechanism protects from those instructions that are repeated 
or become multicycle due to the READY signal and from holds and interrupts. 


An on-chip full-duplex serial port provides direct communication with serial 
devices such as codecs, serial A/D converters, and other serial systems. The 
interface signals are compatible with codecs and many other serial devices 
with a minimum of external hardware. The two serial port memory-mapped 
registers (the data transmit/receive registers) may be operated in either an 
8-bit byte or 16-bit word mode. Each register has an external clock input, a 
framing synchronization input, and associated shift registers. 
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Serial communication can be used between processors in multiprocessing 
applications. The TMS320C2x has the capability of allocating global data 
memory space and communicating with that space via the BR (bus request) 
and READY control signals. The 8-bit memory-mapped global memory allo- 
cation register (GREG) specifies up to 32K words of the TMS320C2x data 
memory as global external memory. The contents of the register determine the 
size of the global memory space. If the current instruction addresses an oper- 
and within that space, BR is asserted to request control of the bus. The length 
of the memory cycle is controlled by the READY line. 


The TMS320C2x supports Direct Memory Access (DMA) to its external 
program/data memory using the HOLD and HOLDA signals. Another processor 
can take complete contro! of the TMS320C2x external memory by asserting 
HOLD low. This causes the TMS320C2x to place its address, data, and control 
lines in the high-impedance state. Signaling between the external processor 
and the TMS320C2x can be performed using interrupts. On the TMS320C25, 
two modes are available: a TMS32020-like mode in which execution is sus- 
pended during assertion of HOLD, and a concurrent DMA mode in which the 
TMS320C25 continues to execute its program while operating from internal 
RAM or ROM, thus greatly increasing throughput in data-intensive applica- 
tions. 
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3.2 Functional Block Diagram 


The functional block diagram shown in Figure 3-2 outlines the principal 
blocks and data paths within the TMS320C2x processors. Further details of 
the functional blocks are provided in the succeeding sections. Refer to Section 
3.3, the internal hardware summary, for definitions of the symbols used in 
Figure 3-2. The block diagram also shows all of the TMS320C2x interface 
pins. Note that the shaded areas on the block diagram indicate enhancements 
provided on the TMS320C25. 


The TMS320C2x architecture is built around two major buses: the program 
bus and the data bus. The program bus carries the instruction code and im- 
mediate operands from program memory. The data bus interconnects various 
elements, such as the Central Arithmetic Logic Unit (CALU) and the auxiliary 
register file, to the data RAM. Together, the program and data buses can carry 
data from on-chip data RAM and internal or external program memory to the 
multiplier in a single cycle for multiply/accumulate operations. 


The TMS320C2x has a high degree of parallelism; e.g., while the data is being 
operated upon by the CALU, arithmetic operations may also be implemented 
in the Auxiliary Register Arithmetic Unit (ARAU). Such parallelism results in 
a powerful set of arithmetic, logic, and bit-manipulation operations that may 
all be performed in a single machine cycle. 
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Figure 3-2. TMS320C2x Block Diagram 
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3.3 Internal Hardware Summary 


The TMS320C2x internal hardware implements functions that other process- 
ors typically perform in software or microcode. For example, the device con- 
tains hardware for single-cycle 16 x 16-bit multiplication, data shifting, and 
address manipulation. This hardware-intensive approach provides computing 
power previously unavailable on a single chip. 


Table 3-1 presents a summary of the TMS320C2x internal hardware. This 
summary table, which includes the internal processing elements, registers, and 
buses, is alphabetized within each functional grouping. All of the symbols 
used in this table correspond to the symbols used in the block diagram of 
Section 3.2, the succeeding block diagrams in this section, and the text 
throughout this document. 
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Table 3-1. TMS320C2x Internal Hardware 


UNIT SYMBOL FUNCTION 


ACC (31-0) | A 32-bit accumulator split in two halves: ACCH (accu- 


ACCH(31-16)] mutator high) and ACCL (accumulator low). Used for 
Arithmetic Logic Unit 


ACCL(15-0) | storage of ALU output. 
U 


A 32-bit two’s-complement arithmetic logic unit having 
two 32-bit input ports and one 32-bit output port feeding 
the accumulator. 


A 16-bit unsigned arithmetic unit used to perform oper-_ 
ations on auxiliary register data. 


Auxiliary Register ARAU 
Arithmetic Unit 
Auxiliary Register File ARO-AR7 A register file containing five/eight 16-bit auxiliary 
(15-0) registers (ARO-AR7), used for addressing data memory, 
temporary storage, or integer arithmetic processing 
through the ARAU. 


Auxiliary Register File AFB(15-0) A 16-bit bus that carries data from the AR pointed to by 
Bus the ARP. 


Auxiliary Register Pointer | ARP(2-0) A 3-bit register used to select one of five/eight auxiliary 
registers. 


Auxiliary Register Pointer} ARB(2-0) A 3-bit register used to buffer the ARP. Each time the 
Buffer 


ARP is loaded, the old value is written to the ARB, except 
during an LST (load status register) instruction. When the 
ARB is loaded with an LST1, the same value is also copied 
into ARP. 


Central Arithmetic Logic CALU The grouping of the ALU, multiplier, accumulator, and 
Unit scaling shifter. 


D(15-0) A 16-bit bus used to route data. 


Data Memory Address DAB(15-0) 
Bus 


A 16-bit bus that carries the data memory address. 


A 9-bit register pointing to the address of the current 
page. Data pages are 128 words each, resulting in 512 
pages of addressable data memory space (some locations 
are reserved). 


A 16-bit bus that carries the ‘direct’ address for the data 


Data Memory Page DP(8-0) 
Pointer 
memory, which is the concatenation of the DP register 


Direct Data Memory DRB(15-0) 
Address Bus 
with the seven LSBs of the instruction. 
Global Memory GREG(7-0) | An 8-bit memory-mapped register for allocating the size 
Allocation Register of the global memory space. 
Instruction Register IR(15-0) A 16-bit register used to store the currently executing in- 
struction. 


Interrupt Flag Register {FR(5-0) A 6-bit flag register used to latch the active-low external 
user interrupts TNT (2-0) and the internal interrupts 
XINT/RINT (serial port transmit/receive) and TINT (timer) 
interrupts. The !FR is not accessible through software. 


A 6-bit memory-mapped register used to mask interrupts. 


Interrupt Mask Register IMR(5-0) 


tTMS320C25 only. 
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Table 3-1. TMS320C2x Internal Hardware (Continued) 


UNIT SYMBOL FUNCTION 


Microcall Stackt MCS (15-0) | A single-word stack that temporarily stores the contents 
of the PFC while the PFC is being used to address data 
memory with the block move (BLKD/BLKP), multiply-ac- 
cumulate (MAC/MACD), and table read/write 
(TBLR/TBLW) instructions. 


Multiplier MULT A 16 x 16-bit parallel multiplier. 
Period Register PRD (15-0) | A16-bit memory-mapped register used to reload the timer. 


Prefetch Countert PFC (15-0) | A 16-bit counter used to prefetch program instructions. 
The PFC contains the address of the instruction currently 
being prefetched. It is updated when a new prefetch is 
initiated. The PFC is also used to address data memory 
when using the block move (BLKD/BLKP), multiply-ac- 
cumulate (MAC/MACD), and_ table read/write 
(TBLR/TBLW) instructions. 


A 32-bit product register used to hold the multiplier pro- 
duct. The PR on the TMS320C25 can also be accessed as 
the most or least significant words using the SPH/SPL 
(store P register high/low) instructions. 


Program Bus P(15-0) A 16-bit bus used to route instructions (and data for the 
MAC and MACD instructions). 


Program Counter PC (15-0) A 16-bit program counter used to address program mem- 
ory. The PC always contains the address of the next in- 
struction to be executed. The PC contents are updated 
following each instruction decode operation. On the 
TMS32020, the operations of the TMS320C25 prefetch 
counter are performed by the program counter. 


Program Memory Address} PAB(15-0) | A 16-bit bus that carries the program memory address. 

Bus 

Queue Instruction OIR(15-0) A 16-bit register used to store prefetched instructions. 

Registert 

Random Access Memory RAM (BO) A RAM block with 256 x 16 locations configured either 

(data or program) as data or program memory. 

Random Access Memory RAM (B1) A data RAM block, organized as 256 x 16 locations. 

(data only) 

Random Access Memory RAM (B2) A data RAM block, organized as 32 x 16 locations. 

(data only) 

Repeat Counter RPTC (7-0) | An 8-bit counter to control the repeated execution of a 

single instruction. 

Serial Port Data DRR(15-0) { A 16-bit memory-mapped serial port data receive 

Receive Register register. Only the eight LSBs are used in the byte mode. 

Serial Port Data DXR(15-0) | A 16-bit memory-mapped serial port data transmit 

Transmit Register register. Only the eight LSBs are used in the byte mode. 
Serial Port Receive RSR(15-0) | A 16-bit register used to shift in serial port data from the 
Shift Registert RX pin. RSR contents are sent to the DRR after a serial 


transfer is compijeted. RSR is not directly accessible 
through software. 


Product Register PR(31-0) 


TTMS320C25 only. 
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Table 3-7? TMS320C2x Internal Hardware (Concluded) 
ginning of a Seria! port transmit operation. XSR is not di- 
rectly accessible through software. 

Shifters are located at the ALU input, the accumulator 
interrupts or calls. The ACCL and data memory values may 
also be pushed onto and popped from the stack. 

Status Registers STO,ST1 Two 16-bit status registers that contain status and 

(15-0) contro! bits. 


UNIT SYMBOL FUNCTION 
output, and the product register output. An in-place shifter 
Temporary Register TR(15-0) A 16-bit register that holds either an operand for the mul- 
‘tiplier or a shift code for the scaling shifter. 


Serial Port Transmit XSR(15-0) A 16-bit register used to shift out serial port data onto 
Shift Registert the DX pin. XSR contents are loaded from DXR at the be- 
3 is also located within the accumulator. 
Stack Stack(15-0) | A 4/8 x 16 hardware stack used to store the PC during 
TIM (15-0) | A 16-bit memory-mapped timer (counter) for timing con- 
trol. 
tTMS320C25 only. 
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3.4 Memory Organization 


3.4.1 Data 


The TMS320C2x provides a total of 544 16-bit words of on-chip data RAM, 
of which 288 words are always data memory and the remaining 256 words 
may be configured as either program or data memory. The TMS320C25 also 
provides 4K words of maskable program ROM. This section explains memory 
management using the on-chip data and program memory, memory maps, 
memory-mapped registers, auxiliary registers, memory addressing modes, and 
memory-to-memory moves. 


Memory | 


The 544 words of on-chip data RAM are divided into three separate blocks 
(BO, B1, and B2), as shown in Figure 3-3. Of the 544 words, 256 words 
(block BQ) are configurable as either data or program memory by instructions 
provided for that purpose; 288 words (blocks B1 and B2) are always data 
memory. A data memory size of 544 words allows the TMS320C2x to handle 
a data array of 512 words (256 words if on-chip RAM is used for program 
memory), while still leaving 32 locations for intermediate storage. See Section 
3.4.3 for memory map configurations. 


The TMS320C2x can address a total of 64K words of data memory. The on- 
chip data memory and internally reserved locations are mapped into the lower 
1K words of the data memory space. Data memory is directly expandable up 
to 64K words while still maintaining full-speed operation. A READY line is 
provided for interface to slower, less-expensive memories, such as DRAMs. 
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Figure 3-3. On-Chip Data Memory 


3.4.2 Program Memory 


On-chip program RAM, ROM, or high-speed external program memory can 
be used at full speed with no wait states. Alternatively, the READY line can 
interface the TMS320C2x to slower, less-expensive external memory. A total 
of 64K words of memory space is available. Internal RAM block BO can be 
configured as program memory using instructions for that purpose. Execution 
from this block can be initiated after the memory space has been reconfigured. 
See Section 3.7.1 for a description of instruction execution using various 
memory configurations. 


In addition, the TMS320C25 is equipped with 4K words of on-chip program 
ROM that can be mask-programmed at the factory with a customer’s program. 
The on-chip ROM allows program execution at full speed without the need 
for high-speed external program memory. The use of this memory also allows 
the external data bus to be freed for access of externa! data memory. 
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Mapping of the first 4K-word block of off-chip/on-chip program memory is 
user-selectable by means of the MP/MC (microprocessor/microcomputer) pin 
on the TMS320C25. This permits the designer to accelerate time-to-market 
with a TMS320C25-based product by using external ROM, and cost-reducing 
it later with the 4K internal ROM without any PC-board redesign. Setting 
MP/MC high maps in the block of off-chip memory; holding the pin low maps 
in the block of on-chip ROM. The XF (external flag) pin can be used to toggle 
the MP/MC pin to dynamically enable or disable the on-chip ROM. Note that 
care must be taken and instruction pipeline operation (see Section 3.6.2) un- 
derstood when using bank switching. 


The MP/MC pin on the TMS320C25 is a Vcc pin on the TMS32020. This 
allows substitution of a TMS320C25 for a TMS32020 since the TMS320C25 
automatically operates in the microprocessor mode and therefore is plug-in 
compatible in the system. See Section 2 for pinouts and signal descriptions. 


3.4.3 Memory Maps 


The TMS320C2x provides three separate address spaces for program memory, 

_ data memory, and I/O, as shown in Figure 3-4. These spaces are distin- 
guished externally by means of the PS, DS, and IS (program, data, and I/O 
space select) signals. The PS, DS, IS, and STRB signals are only active when 
external memory is being addressed. During an internal addressing cycle, these 
signals remain inactive high, thus preventing conflicts in memory addressing, 
e.g., when block BO is configured as program memory. 


The on-chip memory blocks BO, B1, and B2 are comprised of a total of 544 
words of RAM. Program/data RAM block BO (256 words) resides in pages 4 
and 5 of the data memory map when configured as data RAM and at ad- 
dresses >FFOO to >FFFF when configured as program RAM. Block B71 (al- 
ways data RAM) resides in pages 6 and 7, while block B2 resides in the upper 
32 words of page 0. Note that the remainder of page 0 is composed of the 
memory-mapped registers and internally reserved locations, and pages 1-3 of 
the data memory map consist of internally reserved locations. The internally 
reserved locations may not be used for storage, and their contents are unde- 
fined when read. See Section 3.4.4 for further information on the memory- 
mapped registers. 


The on-chip RAM is mapped into either the 64K-word data memory or pro- 
gram memory space, depending on the memory configuration (see Figure 
3-4). The CNFD/CNFP instructions are used to configure block BO as either 
data or program memory, respectively. The BLKP (block move from program 
memory to data memory) instruction may be used to download program in- 
formation to block BO when it is configured as data RAM. Then a CNFP 
(configure block as program memory) instruction may be used to convert it 
to program RAM (see the code example in Section 5.4.2). Regardless of the 
configuration, the user may still execute from external program memory. Note 
that when accessing internal program memory, external control lines remain 
inactive. 


Reset configures block BO as data RAM. Note that, due to internal pipelining, 
when the CNFD or CNFP instruction is used to remap RAM block BO, there 
is a delay before the new configuration becomes effective. This delay is one 
fetch cycle if execution is from internal program RAM. On the TMS32020, a 
delay of one fetch cycle occurs if execution is from external program memory. 
On the TMS320C25, there is a delay of two fetch cycles if execution is from 
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ROM or external program memory. This is particularly important if program 
execution is from the locations around >FFQO. Accordingly, a CNFP instruc- 
tion must be placed at location >FEFD in external memory if execution is to 
continue from the first location in block BO. If a CNFP is placed at location 
>FEFD, and the instruction at location >FEFF is a two-word instruction, the 
second word of the instruction will be fetched from the first location in block 
BO. If execution is from above location >FFOO and block BO is reconfigured, 
care must be taken to assure that execution resumes at the appropriate point 
in a new configuration. 


On-chip program ROM on the TMS320C25 is located in the lower 4K words 
of program memory when selected by setting MP/MC = 0. When MP/MC = 
1, the lower 4K words of program memory are external. 
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Figure 3-4. Memory Maps 
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3.4.4 Memory-Mapped Registers 


The six registers mapped into the data memory space are listed in Table 3-2 
and are shown in the block diagram of Figure 3-2. 


The memory-mapped registers may be accessed in the same manner as any 
other data memory location, with the exception that block moves using the 
BLKD (block move from data memory to data memory) instruction cannot be 
performed from the memory-mapped registers. 


Table 3-2. Memory-Mapped Registers 


REGISTER ADDRESS 
NAME LOCATION DEFINITION 


DRR(15-0) i¢) Serial port data receive register 
DXR(15-0) 1 Serial port data’transmit register 
TIM(15-0) Timer register 

PRD(15-0) Period register 

IMR (5-0) Interrupt mask register 


GREG(7-0) Global memory allocation register 


3.4.5 Auxiliary: Registers 


The TMS320C2x provides a register file containing up to eight auxiliary reg- 
isters (ARO-AR7). The TMS32020 has five auxiliary registers, and the 
TMS320C25 has eight. This section discusses each register’s function and 
how an auxiliary register is selected and stored. 


The auxiliary registers may be used for indirect addressing of data memory or 
for temporary data storage. Indirect auxiliary register addressing (see Figure 
3-5) allows placement of the data memory address of an instruction operand 
into one of the auxiliary registers. These registers are pointed to by a three-bit 
auxiliary register pointer (ARP) that is loaded with a value from 0 through 7, 
designating ARO through AR7, respectively. The auxiliary registers and the 
ARP may be loaded either from data memory or by an immediate operand de- 
fined in the instruction. The contents of these registers may also be stored in 
data memory. (Section 4 describes the programming of the indirect address- 
ing mode.) 
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Figure 3-5. Indirect Auxiliary Register Addressing Example 


The auxiliary register file (ARO-AR4 on the TMS32020 and ARO-AR7 on the 
TMS320C25) is connected to the Auxiliary Register Arithmetic Unit (ARAU), 
shown in Figure 3-6. The ARAU may autoindex the current auxiliary register 
while the data memory location is being addressed. Indexing by either +1 or 
by the contents of ARO may be performed. As a result, accessing tables of 
information does not require the Central Arithmetic Logic Unit (CALU) for 
address manipulation, thus freeing it for other operations. 
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Figure 3-6. Auxiliary Register File 


As shown in Figure 3-6, auxiliary register 0 (ARO) or the eight LSBs of the 
- instruction registers can be connected to one of the inputs of the ARAU. The 
other input is fed by the current AR (being pointed to by ARP). AR(ARP) re- 
fers to the contents of the current AR pointed to by ARP. The ARAU performs 


the following functions: 
AR(ARP) + ARO — AR(ARP) 


AR(ARP) - ARO 


) 


AR(ARP) 


AR(ARP) +1. — AR(ARP) 
AR(ARP) - 1 AR(ARP) 
AR(ARP) + AR(ARP) 


4 


Index the current AR by adding a 16-bit 
integer contained in ARO. 

Index the current AR by subtracting a 
16-bit integer contained in ARO. 
Increment the current AR by one. 
Decrement the current AR by one. 
AR(ARP) is unchanged. 
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In addition to the above functions, the ARAU on the TMS320C25 performs 
functions as follows: 


AR(ARP) + IR(7-0) 


1 


AR(ARP) Add 8-bit immediate value to the cur- 
rent AR. 

AR(ARP) Subtract 8-bit immediate value from 
the current AR. 

AR(ARP) | Bit-reversed indexing, add ARO with 
reverse-carry (rc) propagation (see 
Section 4.1.2). 

AR(ARP) | Bit-reversed indexing, subtract ARO 
with reverse-carry (rc) propagation 
(see Section 4.1.2). 


AR(ARP) - IR(7-0) 


4 


{ 


AR(ARP) + rcARO 


4 


AR(ARP) - rcARO 


Although the ARAU is useful for address manipulation in parallel with other 
operations, it may also serve as an additional general-purpose arithmetic unit 
since the auxiliary register file can directly communicate with data memory. 
The ARAU implements 16-bit unsigned arithmetic, whereas the CALU imple- 
ments 32-bit two’s-complement arithmetic. Instructions provide branches de- 
pendent on the comparison of the auxiliary register pointed to by ARP with 
ARO. The BANZ instruction permits the auxiliary registers to also be used as 
loop counters. - 


The three-bit auxiliary register pointer buffer (ARB), shown in Figure 3-6, 
provides storage for the ARP on subroutine calls and interrupts. 


3.4.6 Memory Addressing Modes 


The TMS320C2x can address a total of 64K words of program memory and 
64K words of data memory. The on-chip data memory is mapped into the 
64K-word data memory space. The on-chip ROM in the TMS320C25 is 
mapped into the program memory space when in the microcomputer mode. 
The memory maps, which change with the configuration of block BO, are de- 
scribed in detail in Section 3.4.4. 


The 16-bit data address bus (DAB) addresses data memory in one of the fol- 
lowing two ways: 


1) By the direct address bus (DRB) using the direct addressing mode (e.g., 
ADD >10), or 


2) By the auxiliary register file bus (AFB) using the indirect addressing 
mode (e.g., ADD *) 


Operands are also addressed by the contents of the program counter in the 
immediate addressing mode. 


Figure 3-7 illustrates operand addressing in the direct, indirect, and immediate 
addressing modes. 
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DIRECT ADDRESSING 


INSTRUCTION 


forcone | oma 


OPERAND 


INSTRUCTION 


INDIRECT ADDRESSING | OPCODE 


3 AR (ARP) 16 OPERAND 


INSTRUCTION 


IMMEDIATE OPERAND OPCODE |OPERAND) PC INSTRUCTION 


ea OPERAND 


Figure 3-7. Methods of Instruction Operand Addressing 


In the direct addressing mode, the 9-bit data memory page pointer (DP) 
points to one of 512 pages, each page consisting of 128 words. The data 
memory address (dma), specified by the seven LSBs of the instruction, points 
to the desired word within the page. The address on the direct address bus 
(DRB) is formed by concatenating the 9-bit DP with the 7-bit dma. 


In the indirect addressing mode, the currently selected 16-bit auxiliary register 
AR(ARP) addresses the data memory through the auxiliary register file bus 
(AFB). While the selected auxiliary register provides the data memory address 
and the data is being manipulated by the CALU, the contents of the auxiliary 
register may be manipulated thraugh the ARAU. See Figure 3-5 for an ex- 
ample of indirect auxiliary register addressing. The direct and indirect ad- 
dressing modes are described in detail in Section 4.1. 


When an immediate operand is used, it is either contained within the instruc- 
tion word itself or, in the case of 16-bit immediate operands, the word fol- 
lowing the instruction opcode. 


3.4.7 Memory-to-Memory Moves 
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The TMS320C2x provides instructions for data and program block moves and 
for data move functions that efficiently utilize the configurable on-chip RAM. 


The BLKD instruction moves a block within data memory, and the BLKP in- 
struction moves a block from program memory to data memory. When used 
with the repeat instructions (RPT/RPTK), the BLKD/BLKP instructions effi- 
ciently perform block moves from on- or off-chip memory. 


Implemented in on-chip RAM, the DMOV (data move) function on the 
TMS320C2x is equivalent to that of the TMS320C1x. DMOV allows a word 
to be copied from the currently addressed data memory location in on-chip 
RAM to the next higher location while the data from the addressed location 
is being operated upon in the same cycle (e.g., by the CALU). An ARAU 
operation may also be performed in the same cycle when using the indirect 
addressing mode. The DMOV function is useful for implementing algorithms 
that use the z™! delay operation, such as convolutions and digital filtering 
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where data is being passed through a time window. The data move function 
can be used anywhere within blocks BO, B1, or B2. It is continuous across the 
boundary of blocks BO and B1 but cannot be used with off-chip data memory. 
The MACD (multiply and accumulate with data move) and the LTD (load T 
register, accumulate previous product, and move data) instructions use the 
data move function. 


The TBLR/TBLW (table read/write) instructions allow words to be transferred 
between program and data spaces. TBLR !s used to read words from on-chip 
ROM or off-chip program ROM/RAM into the data RAM. TBLW is used to 
write words from on-chip data RAM to off-chip program RAM. 
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3.5 Central Arithmetic Logic Unit (CALU) 
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The TMS320C2x Central Arithmetic Logic Unit (CALU) contains a 16-bit 
scaling shifter, a 16 x 16-bit parallel! multiplier, a 32-bit Arithmetic Logic Unit 
(ALU), a 32-bit accumulator (ACC), and additional shifters at the outputs of 
both the accumulator and the multiplier. This section describes the CALU 
components and their functions. Figure 3-8 is a block diagram showing the 
components of the CALU. In the figure, note that SFL and SFR indicate shifts 
to the left or right, respectively. 


The following steps occur in the implementation of a typical ALU instruction: 


1) Data is fetched from the RAM on the data bus, 

2) Data is passed through the scaling shifter and the ALU where the arith- 
metic is performed, and 

3) The result is moved into the accumulator. 


One input to the ALU is always provided from the accumulator, and the other 
input may be transferred from the Product Register (PR) of the multiplier or 
from the scaling shifter that is loaded from data memory. 
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SCALING 
SHIFTER 


SFL(0-16) 


: TMS320C25 specific. 
Shifters on the TMS32020 of 0, 1, or 4. 


Figure 3-8. Central Arithmetic Logic Unit (CALU) 


3.5.1 Scaling Shifter 


The TMS320C2x provides a scaling shifter that has a 16-bit input connected 
to the data bus and a 32-bit output connected to the ALU (see Figure 3-8). 
The scaling shifter produces a left shift of O to 16 bits on the input data, as 
programmed in the instruction. The LSBs of the output are filled with zeros, 
and the MSBs may be either filled with zeros or sign-extended, depending 
upon the status programmed into the SXM (sign-extension mode) bit of sta- 
tus register ST1. 


The TMS320C2x also contains several other shifters, which allow it to perform 
numerical scaling, bit extraction, extended-precision arithmetic, and overflow 
prevention. These shifters are connected to the output of the multiplier and the 
accumulator. 
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3.5.2 ALU and Accumulator 
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The TMS320C2x 32-bit ALU and accumulator implement a wide range of 
arithmetic and logical functions, the majority of which execute in a single 
clock cycle. Once an operation is performed in the ALU, the result is trans- 
ferred to the accumulator where additional operations such as shifting may 
occur. Data that is input to the ALU may be scaled by the scaling shifter. 


The ALU is a general-purpose arithmetic unit that operates on 16-bit words 
taken from data RAM or derived from immediate instructions. In addition to 
the usual arithmetic instructions, the ALU can perform Boolean operations, 
providing the bit manipulation ability required of a high-speed controller. One 
input to the ALU is always provided from the accumulator, and the other input 
may be provided from the Product Register (PR) of the multiplier or the input 
scaling shifter that has fetched data from the RAM on the data bus. After the 
ALU has performed the arithmetic or logical operations, the result is stored in 
the accumulator. 


The 32-bit accumulator (see Figure 3-8) is split into two 16-bit segments for 
storage in data memory: ACCH (accumulator high) and ACCL (accumulator 
low). Shifters at the output of the accumulator provide a left-shift of 0 to 7 
places on the TMS320C25 and of 0, 1, or 4 places on the TMS32020. This 
shift is performed while the data is being transferred to the data bus for stor- 
age. The contents of the accumulator remain unchanged. When the ACCH 
data is shifted left, the LSBs are transferred from the ACCL, and the MSBs are 
lost. When ACCL is shifted left, the LSBs are zero-filled, and the MSBs are 
lost. 


The TMS320C2x supports floating-point operations for applications requiring 
a large dynamic range. The NORM (normalization) instruction is used to nor- 
malize fixed-point numbers contained in the accumulator by performing left 
shifts. The LACT (load accumulator with shift specified by the T register) in- 
struction denormalizes a floating-point number by arithmetically left-shifting 
the mantissa through the input scaling shifter. The shift count, in this case, is 
the value of the exponent specified by the four low-order bits of the T register 
(TR). ADDT and SUBT (add to/subtract from accumulator with shift speci- 
fied by the T register) instructions have also been provided to allow additional 
arithmetic operations. 


The accumulator overflow saturation mode may be programmed through the 
SOVM and ROVM (set/reset overflow mode) instructions. When the accu- 
mulator is in the overflow saturation mode and an overflow occurs, the over- 
flow flag is set and the accumulator is loaded with either the most positive or 
the most negative number depending upon the direction of overflow. The 
value of the accumulator upon saturation is >7FFFFFFF (positive) or 
>80000000 (negative). If the OVM (overflow mode) status register bit is reset 
and an overflow occurs, the overflowed results are loaded into the accumula- 
tor without modification. (Note that logical operations cannot result in over- 
flow.) 


The TMS320C2x can execute a variety of branch instructions that depend on 
the status of the ALU and accumulator. These instructions include the BV 
(branch on overflow) and BZ (branch on accumulator equal to zero). In ad- 
dition, the BACC (branch to address in accumulator) instruction provides the 
ability to branch to an address specified by the accumulator. Bit test in- 
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structions (BIT and BITT), which do not affect the accumulator, allow the 
testing of a specified bit of a word in data memory. 


The accumulator on the TMS320C25 also has an associated carry bit that is 
set or reset depending on various operations within the device. The carry bit 
allows more efficient computation of extended-precision products and addi- 
tions or subtractions. It is also useful in overflow management. The carry bit 
is affected by most arithmetic instructions as well as the shift and rotate in- 
structions. It is not affected by loading the accumulator, logical operations, or 
other such nonarithmetic or control instructions. It is also not affected by the 
multiply (MPY, MPYK, and MPYU) instructions, but is affected by the accu- 
mulation process in the MAC and MACD instructions. Examples of carry bit 
operation are shown in Figure 3-9. 


C MSB LSB C MSB LSB 

X FFFF FFFF ACC X 0000 0000 ACC 
+ 1 = 4 

1 0000 0000 0 FRFFF FFFF 

X YTFFF FFFF ACC X 86000 0000 ACC 
+ 1 (OVM=0) = 1 (OVM=0) 

0 F000 0000 1 7FFP FREE 

1 0000 0000 acc O FFFF FFFF ACC 


+ QO (ADDC 7 O (SUBB 
0 0000 00 0 1 INSTRUCTION) 1 FFFF FF FE INSTRUCTION) 


Figure 3-9. Examples of TMS320C25 Carry Bit Operation 


The value added to or subtracted from the accumulator, shown in the exam- 
ples of Figure 3-9, may come from either the input scaling shifter or the shifter 
at the output of the P register. The carry bit is set if the result of an addition 
or accumulation process generates a carry, or reset to zero if the result of a 
subtraction generates a borrow. Otherwise, it is reset after an addition or set 
after a subtraction. 


The ADDC (add to accumulator with carry) and SUBB (subtract from accu- 
mulator with borrow) instructions provided on the TMS320C25 use the pre- 
vious value of carry in their addition/subtraction operation (see these 
instructions in Section 4 for more detailed information). 


The one exception to operation of the carry bit, as shown in Figure 3-9, is in 
the use of the ADDH (add to high accumulator) and SUBH (subtract from 
high accumulator) instructions. The ADDH instruction can only set the carry 
bit if a carry is generated, and the SUBH instruction can only reset the carry 
bit if a borrow is generated; otherwise, neither instruction can affect it. 


Two branch instructions, BC and BNC, have been provided for branching on 
the status of the carry bit. The SC, RC, and LST1 instructions can also be used 
to load the carry bit. The carry bit is set to one on a hardware reset. 


The SFL and SFR (in-place one-bit shift to the left/right) instructions on the 
TMS320C2x and the ROL and ROR (rotate to the left/right) instructions on 
the TMS320C25 implement shifting or rotating of the contents of the accu- 
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mulator through the carry bit. The SXM bit affects the definition of the SFR 
(shift accumulator right) instruction. When SXM = 1, SFR performs an 
arithmetic right shift, maintaining the sign of the accumulator data. When 
SXM = 0, SFR performs a logical shift, shifting out the LSB and shifting in a 
zero for the MSB. The SFL (shift accumulator left) instruction is not affected 
by the SXM bit and behaves the same in both cases, shifting out the MSB and 
shifting in a zero. Repeat (RPT or RPTK) instructions may be used with the 
shift and rotate instructions for multiple shift counts. 


_ 3.5.3. Multiplier, T and P Registers 


The TMS320C2x utilizes a 16 x 16-bit hardware multiplier, which is capable 
of computing a signed or unsigned 32-bit product in a single machine cycle. 
All multiply instructions, except the MPYU (multiply unsigned) instruction on 
the TMS320C25, perform a signed multiply operation in the multiplier. That 
is, the two numbers being multiplied are treated as two’s-complement num- 
bers, and the result is a 32-bit two’s-complement number. As shown in Figure 
3-8, the following two registers are associated with the multiplier: 


e A 16-bit temporary register (TR) that holds one of the operands for the 
multiplier, and 


® A 32-bit product register (PR) that holds the product. 


The output of the product register can be left-shifted 1 or 4 bits. This is useful 
for implementing fractional arithmetic or justifying fractional products. The 
output of the PR can also be right-shifted 6 bits to enable the execution of 
up to 128 consecutive multiply/accumulates without the possibility of over- 
flow. 


An LT (load T register) instruction normally loads the TR to provide one op- 
erand (from the data bus), and the MPY (multiply) instruction provides the 
second operand (also from the data bus). A multiplication can also be per- 
formed with an immediate operand using the MPYK instruction. In either 
case, a product can be obtained every two cycles. 


Two multiply/accumulate instructions (MAC and MACD) fully utilize the 
computational bandwidth of the multiplier, allowing both operands to be 
processed simultaneously. The data for these operations may reside anywhere 
in internal or external memory, or can be transferred to the multiplier each cy- 
cle via the program and data buses. This provides for single-cycle 
multiply/accumulates. when used with repeat (RPT/RPTK) instructions. Note 
that the DMOV portion of the MACD instruction will not function with ex- 
ternal data memory addresses. On the TMS32020, the multiplier and multi- 
plicand must reside in separate on-chip RAM blocks. On the TMS320C25, the 
MAC and MACD instructions can be used with both operands in either inter- 
nal or external memory or one each in on-chip RAM. The SORA (square/add) 
and SORS (square/subtract) instructions pass the same value to both inputs 
of the multiplier for squaring a data memory value. 


The MPYU instruction on the TMS320C25 performs an unsigned multipli- 
cation, which greatly facilitates extended-precision arithmetic operations. The 
unsigned contents of the T register are multiplied by the unsigned contents 
of the addressed data memory location, with the result placed in the P register. 
This allows operands of greater than 16 bits to be broken down into 16-bit 
words and processed separately to generate products of greater than 32 bits. 
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After the multiplication of two 16-bit numbers, the 32-bit product is loaded 
into the 32-bit Product Register (PR) on the TMS320C2x. The product from 
the PR may be transferred to the ALU. 


Four product shift modes (PM) are available at the Product Register (PR) 
output, which are useful when performing multiply/accumulate operations, 
fractional arithmetic, or justifying fractional products. The PM field of status 
register ST1 specifies the PM shift mode, as shown in Table 3-3. 


Table 3-3. PM Shift Modes 


| IF PM Is: _| RESULT 


No shift 
Left shift of 1 bit 
Left shift of 4 bits 
Right shift of 6 bits 


Left shifts specified by the PM value are useful for implementing fractional 
arithmetic or justifying fractional products. For example, the product of either 
two normalized, 16-bit, two’s-complement numbers or two Q15 numbers 
contains two sign bits, one of which is redundant. Q15 format, one of the 
various types of QO format, is a number representation commonly used when 
performing operations on non-integer numbers (see Section 5.6.6 for an ex- 
planation and examples of Q15 representation). The single-bit left-shift 
eliminates this extra sign bit from the product when it is transferred to the ac- 
cumulator. This results in the accumulator contents being formatted in the 
same manner as the multiplicands. Similarly, the product of either a normal- 
ized, 16-bit, two’s-complement or Q15 number and a 13-bit, two’s- 
complement constant contains five sign bits, four of which are redundant. 
This is the case, for example, when using the MPYK instruction. Here the 
four-bit shift properly aligns the result as it is transferred to the accumulator. 


Use of the right-shift PM value allows the execution of up to 128 consecutive 
multiply/accumulate operations without the threat of an arithmetic overflow, 
thereby avoiding the overhead of overflow management. The shifter can be 
disabled to cause no shift in the product when working with integer or 32-bit 
precision operations. This allows compatibility with TMS320C1x code to be 
maintained. Note that the PM right shift is always sign-extended regardless 
of the state of SXM. 


The four least significant bits of the T register (TR) also define a variable shift 
through the scaling shifter for the LACT/ADDT/SUBT (load/add- 
to/subtract-from accumulator with shift specified by the TR) instructions. 
These instructions are useful in floating-point arithmetic where a number 
needs to be denormalized, i.e., floating-point to fixed-point conversion. The 
BITT (bit test) instruction allows testing of a single bit of a word in data 
memory based on the value contained in the four LSBs of the TR. 
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3.6 System Control 


System contro! on the TMS320C2x is provided by the program counter, 
hardware stack, PC-related hardware, the external reset signal, interrupts (see 
Section 3.8), the status registers, the on-chip timer, and the repeat counter. 
The following sections describe the function of each of these components in 
system control and pipeline operation. 


3.6.1 Program Counter and Stack 
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The TMS320C2x contains a 16-bit Program Counter (PC) and a hardware 
stack of four (TMS32020) or eight (TMS320C25) locations for PC storage 
(see Figure 3-10). The program counter addresses internal and external pro- 
gram memory in fetching instructions. The stack is used during interrupts and 
subroutines. 


TO PROGRAM 
ADDRESS BUS 


| TMs320C25 specific. 
Four-level stack provided on the TMS32020. 


Figure 3-10. Program Counter, Stack, and Related Hardware 


The program counter addresses program memory, either on-chip or off-chip, 
via the Program Address Bus (PAB). Through the PAB, an instruction is 
fetched from program memory and loaded into the Instruction Register (IR). 
When the IR is loaded, the PC is ready to start the next instruction fetch cycle. 
The PC may address on-chip RAM block BO when BO is configured as pro- 
gram memory, or the on-chip ROM provided on the TMS320C25. The PC 
also addresses off-chip program memory through the external address bus 
Ai5-A0 and the external data bus D15-D0. 


Architecture - System Control 


Data memory is addressed by the program counter during a BLKD instruction, 
which moves data blocks from one section of data memory to another. The 
contents of the accumulator may be loaded into the PC in order to implement 
“computed GOTO” operations. This can be accomplished using the BACC 
(branch to address in accumulator) or CALA (call subroutine indirect) in- 
structions. 


To start a new fetch cycle, the PC is loaded either with PC+1 or with a branch 
address (for instructions such as branches, calls, or interrupts). In the case of 
conditional branches where the branch is not taken, the PC is incremented 
once more beyond the location of the branch address. 


The TMS320C2x also has a feature, which allows the execution of the next 
single instruction N+1 times. N is defined by loading an 8-bit counter RPTC 
(repeat counter). If this repeat feature is used, the instruction is executed, and 
the RPTC is decremented until the RPTC goes to zero. This feature is useful 
with many instructions, such as NORM (normalize contents of accumulator), 
MACD (multiply and accumulate with data move), and SUBC (conditional 
subtract). When used with some multicycle instructions, such as MACD, the 
repeat features can result in these instructions effectively executing in a single 
cycle. 


The stack is 16 bits wide and four (TMS32020) or eight (TMS320C25) levels 
deep. The PC stack is accessible through the use of the PUSH and POP in- 
structions. Whenever the contents of the PC are pushed onto the top of the 
stack, the previous contents of each level are pushed down, and the bottom 
(fourth/eighth) location of the stack is lost. Therefore, data will be lost if more 
than four/eight successive pushes occur before a pop. The reverse happens 
on pop operations. Any pop after three/seven sequential pops yields the value 
at the bottom stack level. All of the stack levels then contain the same value. 
Two additional instructions, PSHD and POPD, push a data memory value 
onto the stack or pop a value from the stack to data memory. These in- 
structions allow a stack to be built in data memory for the nesting of 
subroutines/interrupts beyond four/eight levels. 


Note that on the TMS32020, the TBLR/TBLW, MAC/MACD, and 
BLKD/BLKP instructions use one level of the stack. The TMS320C25 contains 
a separate stack for use with these instructions, and no level of the PC is used. 


3.6.2 Pipeline Operation 


Instruction pipelining consists of the sequence of external bus operations that 
occurs during instruction execution. The prefetch-decode-execute pipeline is 
essentially invisible to the user, except in some cases where the pipeline must 
be broken (such as for branch instructions). In the operation of the pipeline, 
the prefetch, decode, and execute operations are independent, which allows 
instruction executions to overlap. Thus, during any given cycle, two or three 
different instructions can be active, each at a different stage of completion, 
resulting in the respective two-level pipeline on the TMS32020 or the three- 
level pipeline on the TMS320C25. 


The difference in pipeline levels does not necessarily affect instruction exe- 
cution speed, but merely changes the fetch/decode sequence. Most in- 
structions execute in the same number of cycles regardless of whether they 
are executed from internal RAM, ROM, or external program memory. The ef- 
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fects of pipelining are included in the instruction cycle timings for the 
TMS32020 and TMS320C25 listed in Appendix D. 


Additional PC-related hardware (see Figure 3-10) is provided on the 
TMS320C25 to allow three-level pipelining for higher performance. Included 
in the related hardware are the Prefetch Counter (PFC), the 16-bit MicroCall 
Stack (MCS) register, the Instruction Register (IR), and the Queue Instruction 
Register (QIR). 


In the three-level pipeline on the TMS320C25, the PFC contains the address 
of the next instruction to be prefetched. Once an instruction is prefetched, the 
instruction is loaded into the IR, unless the IR still contains an instruction 
currently executing, in which case the prefetched instruction is stored in the 
QIR. The PFC is then incremented, and after the current instruction has 
completed execution, the instruction in the QIR is loaded into the IR to be 
executed. 


The PC contains the address of the next instruction to be executed, and is not 
used directly in instruction fetch operations, but merely serves as a reference 
pointer to the current position within the program. The PC is incremented as 
each instruction is executed. When interrupts or subroutine call instructions 
occur, the contents of the PC are pushed onto the stack to preserve return 
linkage to the previous program context. 


The prefetch, decode, and execute operations of the pipeline are independent, 
thus allowing instruction executions to overlap. During any given cycle, three 
different instructions can be active, each at a different stage of completion. 
Figure 3-11 shows the operation of the three-level pipeline for single-word, 
single-cycle instructions executing from either internal program ROM or ex- 
ternal memory with no wait states. 


CLKOUT1 | | | | | | | 


prefetch <«—_—__N___-__N44___44-_ M2 ye — ? N+ : N+2 ; 
decode Gg IN ge NN N N+1 
execute ¢—_—__Ne2__y¢___No1 4g —__N gg 


Figure 3-11. Three-Level Pipeline Operation (TMS320C25) 


Pipelining is reduced to two levels when execution is from internal program 
RAM due to the fact that an instruction in internal RAM can be fetched and 
decoded in the same cycle. Thus, separate prefetch and decode operations are 
not required, as shown in Figure 3-12. 
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CLKOUT1 | | | | | | 


prefetch Ng NT gg Ng 
dencda “ N ea N+1 ae N+2 4 
execute «—_—Nel___4_N___yg _Ne1 gy 


Figure 3-12. Two-Level Pipeline Operation 


The following paragraphs describe, in detail, the operation of the TMS320C25 
pipeline. This description, in conjunction with Appendix D, gives sufficient 
information for predicting the operation of the TMS320C25 for hardware in- 
terface optimization, accurate program cycle counting, and simulation model- 
ling. Often it is not necessary to understand the intricate detail of the pipeline 
to design with the TMS320C25. Therefore, if the user is not specifically in- 
terested in these details, it is suggested that this description be skipped. 


The TMS320C25 executes most of its instructions in a single cycle, because 
all the instructions are straight decodes and highly pipelined as opposed to 
microcode. The basic pipeline operation is 3.25 cycles deep where the device 
sequence on any given cycle is fetching the third instruction, decoding the 
second instruction, and executing the first. Figure 3-13 shows the internal 
Operation of the TMS320C25 pipeline in reference to quarter phases 1 through 
4 (Q1-Q4). 
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Figure 3-13. TMS320C25 Standard Pipeline Operation 


The TMS320C25 machine cycle, externally referenced by the falling edges of 
the CLKOUT1 signal, consists of four internal cycles (or CLKIN cycles). This 
allows internal operations of the pipeline to execute as fast as 1/4 the machine 
cycle. The sequence of a general instruction execution in the pipeline is 
shown in Table 3-4. . 
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Table 3-4. Instruction Pipeline Sequence 


CYCLE | Q PHASE | OPERATION 


- New PC is output on address bus 


External read of instruction 
External read of instruction 
External read of instruction 


Instruction decode 
Instruction decode/ARAU execution 
On-chip RAM access/ARAU execution 


On-chip RAM access/load new AR value/update ARP 
ALU execution 
ALU execution 
Load accumulator 


Load status register 


When using an add instruction (e.g., ADD *+,12,AR4), the device fetches the 
instruction in cycle 1. During Q2 and Q3 of cycle 2, the instruction is de- 
coded. This includes the ALU command decode as well as generation of the 
data operand fetch address. In this case, the address comes from an auxiliary 
register. During 04 of cycle 2 and Q1 of cycle 3, the operand is fetched from 
the RAM location. The increment of the auxiliary register is performed during 
Q3 and QO4 of cycle 2, and the value is loaded into the auxiliary register in Q1 
of cycle 3. The ARP is also updated in Q1 of cycle 3. During Q2 and Q3 of 
cycle 3, the data is passed through the barrel shifter to execute the 12-bit 
left-shift, and the data is added by the ALU to the contents in the accumulator. 
In O4 of the third cycle, the ALU result is loaded into the accumulator. The 
status of the ALU operation is loaded into the status register in Q1 of the 
fourth cycle. The bits being loaded into the status register at this time consist 
of the current ALU status and the ARP associated with the next instruction. 


In the case of a store instruction (e.g., SACL *0-,3,AR2), the device operates 
the first two cycles in the same manner as the ADD instruction. In Q1 and 
Q2 of the third cycle, the data in the accumulator is passed through a barrel 
shifter, left-shifted 3 bits, and zero-filled. The lower 16 bits of the shifted 
value are written to the address specified by the current auxiliary register. 
During Q3 and Q4 of the third cycle, the index register (ARO) is added to the 
contents of the current auxiliary register and loaded back into the current 
auxiliary register in Q1 of the fourth phase. [n Q1 of the fourth cycle, the 
auxiliary register pointer is changed to AR2. There is no execution phase of 
this instruction. Figure 3-14 shows the ADD and SACL instructions operating 
back-to-back in a program sequence. It is assumed that both instructions re- 
side in external, zero wait-state memory and that the data resides in on-chip 
RAM. 
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CYCLE 1 | CYCLE 2 | CYCLE 3 | CYCLE 4 | CYCLE 5 


Figure 3-14. Pipeline Operation of ADD Followed by SACL 


When the device is reading instructions out of on-chip ROM, the basic inter- 
nal operation of the pipeline is the same. The only difference is that the con- 
trol lines (i.e., STRB, PS, and R/W) are inactive. If the device is fetching the 
instructions from on-chip RAM, the pipeline is shortened to 2.5 cycles since 
the device can fetch the instruction in half a cycle as opposed to the full cycle 
required in an external or on-chip ROM fetch. The instruction is fetched dur- 
ing Q4 and Q1, then decoded in Q2 and Q3. The rest of the pipeline tracks 
as described above. 


Some operations add additional machine cycles to the instruction execution 
without damaging the integrity of the program or hardware. External wait 
states, multiplexed data bus conflicts, two-word instructions, and program 
counter discontinuities are included in these operations, as described in the 
following paragraphs. 


Wait States. The TMS320C25 is designed to be interfaced to slower ex- 
ternal devices through the use of hardware-generated wait states. This applies 
to the program, data, and 1/O memory spaces of the Harvard architecture. 
Wait states are a direct delay on the instruction pipeline. Each wait state in- 
serted during the instruction fetch contributes an additional machine cycle in 
the pipeline execution of the instruction. In addition, any wait state incurred 
when accessing external data or I/O space also contributes an additional ma- 
chine cycle to the pipeline execution of the instruction. This factor applies to 
all instructions. Figure 3-15 describes how the pipeline reacts to wait states 
in external program memory. Note that the wait state added in cycle 2 results 
in a no-execution operation in cycle 4. 
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CYCLE 1 | CYCLE 2 | CYCLE 3 | CYCLE 4 | CYCLE 5 


Figure 3-15. Pipeline Operation with Wait States 


Multiplexed External Data Bus. The external data bus is multiplexed to 
support all three memory spaces of the TMS320C25. Therefore, external 
fetches to multiple spaces in the same instruction add additional machine cy- 
cles to the pipeline execution of the instruction. This is due to the fact that 
the external fetch takes a full cycle whereas the internal equivalent takes two 
quarter phases and can be included in the execution stage of the three-deep 
pipeline. Accessing the data memory space is controlled by setting of the data 
page pointer or the value contained in the auxiliary register used in any in- 
struction. Also affecting the pipeline in this manner is the access of the I/O 
bus or the tables in program memory (i.e., IN, OUT, TBLR, and TBLW). Figure 
3-16 shows how the pipeline processes an instruction with external program 


-and data access. 


3-35 


Architecture - System Control 


CYCLE 1 | CYCLE 2 | CYCLE 3 | CYCLE 4 | CYCLE 5 
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Figure 3-16. Pipeline with External Data Bus Conflict 


Two-Word Instructions. All two-word instructions take an additional cy- 
cle to fetch the 16-bit immediate operand following the instruction mnemonic. 
The first set of instructions for which this applies is the long immediate in- 
structions. The instruction mnemonic is followed by a 16-bit immediate op- 
erand to be executed upon in the ALU. The second set applies to those 
instructions that use the PFC register as a second data addressing unit on 
some optimized instructions, e.g., the multiply/accumulate and block move 
instructions (MAC, MACD, BLKP, and BLKD). In the second set, the extra 
cycle only appears once in a repeat loop. The third set involves conditional 
branches not taken. 
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Program Counter Discontinuities. Since the TMS320C25 is pipelined, a 
change (other than an increment) in the program counter requires that the 
pipeline be flushed. This applies to all branches, subroutine calls, software 
trap, interrupt traps, and return. The pipeline, being three deep, has the next 
instruction already loaded when the branch occurs. At this point, this instruc- 
tion will not affect any data or registers, so it is cleared from the pipeline. 
Therefore, two dead execution cycles are inserted while waiting for the pipe- 
line to reload. The device only takes one additional cycle if the destination of 
the branch is in on-chip RAM block 0. The pipeline is only two-deep in this 
case and only takes one cycle to reload. Figure 3-17 shows a branch from 
normal execution to an address in on-chip RAM, and Figure 3-18 shows an 
example of a return executed from on-chip RAM to a location in off-chip 
memory. 


CYCLE 1 | CYCLE 2 | CYCLE 3 | CYCLE 4 | CYCLE 5 


Figure 3 17. Pipeline Operation of Branch to On-Chip RAM 
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Figure 3-18. Pipeline Operation of RET from On-Chip RAM 


Interrupts are hardware-generated discontinuities to the sequential accessing 
of the program counter. The interrupt is executed based upon instruction ex- 
ecution complete, rather than memory operation complete. The instruction 
that is currently executing at the time of an interrupt executes completely. The 
interrupt traps following the completion of that instruction before the start of 
the execution of the next instruction. In this case, the repeated instruction is 
considered one execution; therefore, the repeat loop finishes before the inter- 
rupt trap is taken. This gives priority to the algorithm over the interrupt service. 
The interrupt operation in reference to the pipeline execution is illustrated in 
the data sheet timing diagrams (see Appendix A). Note that when interrupt 
vectors reside in external memory running with one wait state, there are two 
interrupt acknowledge (IACK) pulses. If this is a problem, the IACK line should 
be gated with READY. 
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Hardware Aspects of the Pipeline 


Viewing these effects on the pipeline at the hardware level requires additional 


explanation due to the lack of visibility of on-chip operations or optimization .- 


of the pipeline execution. The following paragraphs describe the effects of | 
HOLD/HOLDA, RS, interrupts, accumulator store, on-chip program access, ex- 
ternal data access, and repeats as they are visible from the pins of the device... 


In the cases of RS, interrupts, and HOLD/HOLDA, the effects on the pipeline - 


are shown in the data sheet timing diagrams (see Appendix A). 


Reset. The reset interrupt is a totally non-maskable interrupt. When exe- 
cuted, it stops operation of the pipeline and flushes the unexecuted parts. The. _ 
reset pulse must be at least three CLKOUT cycles wide. After the second 
CLKOUT cycle has completed (before the third rising edge of CLKOUT1), the 
device has brought all outputs into a high-impedance state. After the rising. 
edge of RS, the device begins to fetch the reset vector. Since the pipeline is. . 


empty, it does not execute the reset vector branch until two cycles later. If the: : 


HOLD line is brought low during the active reset, the device does not start the’ 
fetch of the reset vector until after the active HOLD is removed, and the device 
deactivates the HOLDA line. When HOLD is activated with RS to allow boot-. . 
loading of the code, the HOLDA line will go active low in three cycles, regard-_ 
less of whether or not the RS_ line has gone high. This is useful in that the 


HOLDA line can be used to enable the release of the RS line and guarantee the .- 


required three-cycle reset. 


s 


Interrupts. The effects of an interrupt become apparent on the hardware. * 
when a interrupt acknowledge (IACK) signal is valid on the rising edge of. 


CLKOUT2. This signifies the fetch of the first word of the interrupt vector. If. 
wait states are generated in the memory segment where the interrupt vector .... 


resides, an additional IACK pulse occurs for each wait state added. If this. 
causes a problem with the externa! interface, IACK can be gated with READY. 
to only accept the last interrupt acknowledge pulse. Note that the BIOZ in- 
struction tests the level of the BIO pin during the instruction fetch phase of the 
pipeline. = 


Hold/Hold Acknowledge. The hold operation, like that of interrupt, takes: ’ 
second priority to algorithm execution; therefore, the hold will not be ac-:. 

knowledged until after the currently running instruction is completed (a min-:. 
imum of three cycles). This includes repeated instructions. The next. 
instruction, after the final instruction executed before HOLDA, is latched into 
the pipeline and executed two cycles after the HOLDA line goes inactive high. 
The second instruction after the iast instruction executed is fetched two cycles. 


again after the HOLDA line goes inactive high. If the HM bit of status register .- 


ST1 is set high, the TMS320C2Z5 stops execution and sits idle until the hold 
is removed. This lowers power consumption by removing the drive of the 
memory address and control lines and also stopping major parts of the internal 
CPU circuits from switching and drawing power. This can be used as a 
hardware powerdown mode. If the HM bit is low, the TMS320C25 continues 
executing any instruction that can be executed with on-chip resources only. _ 
This means both program and data reside in on-chip memory. The device will - 


continue to operate normally unless an off-chip access is required by an in-) 


struction, at which time the processor adds wait states until the hold state is - 
removed. When running frori on-chip resources with HM = 0, the pyecessar, 
acknowledges HOLD with HOLDA during a multicycle instruction. 
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On-Chip Program Access. When executing from on-chip resources, the 
pipeline is visible only in the MSC line, which signals microstate complete 
when active low on the rising edge of CLKOUT2. Note that executing from 
on-chip program memory does not allow instruction accessing of external data 
memory to run in a single cycle. The normal operation of the instruction takes 
only two quarter phases of the execution cycle to fetch the on-chip data me- 
mory, whereas off-chip access requires all four quarter phases. The pipeline 
is, however, optimized to handle a repeated instruction that accesses external 
data memory with only one extra cycle for the first external fetch. 


External Program/Data Access. Visibility of the pipeline when using ex- 
ternal program and data memory requires a monitoring of the MSC, STRB, PS, 
and DS lines. The MSC line indicates at the rising edge of CLKOUT2 whether 
or not the cycle is the beginning of a new instruction fetch; i.e., MSC active 
low indicates the completion of an instruction and the acquisition of another 
instruction. The PS (program select) line indicates that the data bus is cur- 
rently being used to fetch an instruction. A step in the pipeline is not indicated 
since the PS line remains while the pipeline is fetching instructions externally. 
To track the fetches, the STRB line, which frames external accesses, must be 
monitored. 


The PS line being active low does not necessarily mean that the device is 
fetching an instruction. In the cases of table read/write (TBLR/TBLW), 
multiply/accumulate (MAC/MACD), and block transfer (BLKP) instructions, 
the device uses the PS line active low to access tables. 


To monitor external data memory fetches, the user should watch the data se- 
lect (DS) line in conjunction with the STRB line. An active low on the DS line 
indicates the data bus is currently being used to access data memory space. 
This line remains low for two memory fetches in the case of an accumulator 
store followed by an ALU instruction, both operating with off-chip memory. 
However, two STRB pulses will identify the individual access. Likewise, the 
line remains low for many cycles in the case of a repeated instruction. 1/O 
space access operates similarily to data space operation with the OUT and IN . 
instructions replacing the save and ALU instruction. 


A clear understanding of this information in conjunction with the data in Ap- 
pendix D of the TMS320C2x User's Guide should be sufficient to predict 
correctly the operation of the TMS320C25 pipeline. 
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3.6.3 Reset 


Reset (RS) is a non-maskable external interrupt that can be used at any time 
to put the TMS320C2x into a known state. Reset is typically applied after 
powerup when the machine is in a random state. 


Driving the RS signal low causes the TMS320C2x to terminate execution and 
forces the program counter to zero. RS affects various registers and status bits. 
At powerup, the state of the processor is undefined. For correct system op- 
eration after powerup, a reset signal must be asserted low for at least three 
clock cycles to guarantee a reset of the device (see Section 5.1 for other im- 
portant reset considerations). Processor execution begins at location 0, which 
normally contains a B (branch) statement to direct program execution to the 
system initialization routine (also see Section 5.1 for an initialization routine 
example). Section 6.1 provides system control circuitry design examples. 


Upon receiving an RS signal, the following actions take place: 


1) A logic O is loaded into the CNF (configuration control) bit in status 
register ST1, causing all RAM to be configured as data memory. 


2) The Program Counter (PC) is set to 0, and the address bus A15-A0 is 
driven with all zeroes while RS is low. 


3) The data bus D15-D0 is placed in the highs impedance state. 


4) All memory and 1/O space control signals (PS, DS, IS, R/W, STRB, and 
BR) are de-asserted by setting them to high levels while RS is low. 


5) All interrupts are disabled by setting the INTM (interrupt mode) bit to 
1. (Note that RS is non-maskable.) The interrupt flag register (IFR) is 
reset to all zeroes. 


6) Status bits: 
0 > OV and 1 > XF (TMS32020); in addition, on the TMS320C25, 
1 ~ SXM, 0 ~ PM, 1 7 HM, 0 > FO, 1 > C, and 1 > FSM. 
(The remaining status bits on the TMS320C2x are unchanged.) 


7) The global memory allocation register (GREG) is cleared to make all 
memory local. 


8) The RPTC (repeat counter) is cleared. 


9) The DX (data transmit) pin is placed in the high-impedance state. Any 
transmit/receive operations on the serial port are terminated, and the 
TXM (transmit mode) bit is reset to a low level. This configures the FSX 
framing pulse to be an input. A transmit/receive operation may be 
started by framing pulses only after the removal of RS. 


10) The TIM register is set to the maximum value (>FFFF) on reset for both 
the TMS32020 and TMS320C25. The PRD register on the TMS320C25 
is also initialized by reset to >FFFF. The TMS32020 requires a software 
initialization of the PRD register (see Example 5-1). The TIM register 
begins decrementing only after RS is de-asserted. 


11) The IACK (interrupt acknowledge) signal is generated in the same man- 
ner as a maskable interrupt. . 
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12) The state of the RAM is undefined following RS. 


13) The ARB, ARP, DP, IMR, OVM, and TC bits are not initialized by reset. 
Therefore, it is critical that these bits be initialized in software by the user 
following reset. 


Execution starts from location 0 of program memory when the RS signal is 
taken high. Note that if RS is asserted while in the hold mode, normal reset 
Operation occurs internally, but all buses and control lines remain in the 
high-impedance state. Upon release of HOLD and RS, execution starts from 
location zero. The TMS320C2x can be held in the reset state indefinitely. 


3.6.4 Status Registers 
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Two status registers, STO and ST1, contain the status of various conditions 
and modes. The status registers can be stored into data memory and loaded 
from data memory, thus allowing the status of the machine to be saved and 
restored for interrupts and subroutines. All status bits are written to and read 
from using LST/LST1 and SST/SST1 instructions, respectively (with the ex- 
ception of INTM, which cannot be loaded via an LST instruction). 


Figure 3-19 shows the organization of both status registers, indicating all 
status bits contained in each. Note that the DP, ARP, and ARB registers are 
shown as separate registers in the processor block diagram of Figure 3-2. 
Because these registers do not have separate instructions for storing them into 
RAM, they are included in the status registers. As shown in Figure 3-19, se- 
veral bits in the status registers are reserved and read as logic one’s by the LST 
and LST1 instructions. 


15 14 13 7 #66 5 4 3 2 1 0 


sto eS 


15 14 13 11. 10 


Ora the TMS32020, bits 5, 6, and 9 of ST1 are logic one’s. 


Figure 3-19. Status Register Organization 


Some additional instructions or functions may affect the status bits, as indi- 
cated in Table 3-5. 
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Table 3-5. Status Register Field Definitions 


FIELD FUNCTION 


Auxiliary Register Pointer Buffer. Whenever the ARP is loaded, the old 
ARP value is copied to the ARB except during an LST instruction. When 
the ARB is loaded via an LST1 instruction, the same value is also copied 
to the ARP. 


Auxiliary Register Pointer. This three-bit field selects the AR to be used in 
indirect addressing. When ARP is loaded, the old ARP value is copied to 
the ARB register. ARP may be modified by memory-reference instructions 
when using indirect addressing, and by the LARP, MAR, and LST in- 
structions. ARP is also loaded with the same value as ARB when an LST1 
instruction is executed. 


Carry bit. This bit is set to 1 if the result of an addition generates a carry, 
or reset to O if the.result of a subtraction generates a borrow. Otherwise, 
it is reset after an addition or set after a subtraction, except if the instruc- 
tion is ADDH or SUBH. ADDH can only set and SUBH only reset the carry 
bit, but cannot affect it otherwise. The shift and rotate instructions also 
affect this bit, as well as the SC, RC, and LST1 instructions. Two branch 
instructions, BC and BNC, have been provided to branch on the status of 
C. Cis set to 1 on a reset. 


On-Chip RAM Configuration Control bit. If set to 0, block BO is config- 
ured as data memory; otherwise, block BO is configured as program 
memory. The CNF may be modified by the CNFD, CNFP, and LST1 in- 
structions. RS resets the CNF to 0. 


Data Memory Page Pointer. The 9-bit DP register is concatenated with 
the 7 LSBs of an instruction word to form a direct memory address of 16 
bits. DP may be modified by the LST, LDP, and LDPK instructions. 


Format bit. When set to 0, the serial port registers are configured as 16-bit 
registers. When set to 1, the port registers are configured to receive and 
transmit eight-bit bytes. FO may be modified by the FORT and LST1 in- 
structions. FO is reset to 0. 


Frame Synchronization Mode bit. This bit indicates whether the serial port 
operates with or without frame sync pulses. When FSM = 1, the serial 
port operation is initiated following a frame sync pulse on the FSX/FSR 
inputs. When FSM = 0, the FSX/FSR inputs are ignored and the serial 
port operates continuously with no frame sync pulses required. The bit is 
set to 1 by a reset. 


Hold Mode bit. When HM = 1, the processor halts internal execution 
when acknowledging an active HOLD. When HM = 0, the processor may 
continue execution out of internal program memory but puts its external 
interface in a high-impedance state. This bit is set to 1 by a reset... 


Interrupt Mode bit. When set to 0, all unmasked interrupts are enabled. 
When set to 1, all maskable interrupts are disabled. INTM is set and reset 
by the DINT and EINT instructions, RS and IACK also set INTM. INTM 
has no effect on the unmaskable RS interrupt. Note that INTM is unaf- 
fected by the LST instruction. 


Overflow Flag bit. As a latched overflow signal, OV is set to 1 when ov- 
erflow occurs in the ALU. Once an overflow occurs, the OV remains set 
until a reset, BV, BNV, or LST instruction clears the OV. 


TTMS320C25 only. 
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Table 3-5. Status Register Field Definitions (Concluded) 


FUNCTION 


Overflow Mode bit. When set to 0, overflowed results overflow normally 
in the accumulator. When set to 1, the accumulator is set to either its most 
positive or negative value upon encountering an overflow. The SOVM and 
ROVM instructions set and reset this bit, respectively. LST may also be 
used to modify the OVM. 


Product Shift Mode. If these two bits are 00, the multiplier’s 32-bit prod- 
uct is loaded into the ALU with no shift. If PM = 01, the PR output is 
left-shifted one place and loaded into the ALU, with the LSBs zero-filled. 
If PM = 10, the PR output is left-shifted by four bits and loaded into the 
ALU, with the LSBs zero-filled. PM = 11 produces a right shift of six bits, 
sign-extended. Note that the PR contents remain unchanged. The shift 
takes place when transferring the contents of the PR to the ALU. PM is 
loaded by the SPM and LST1 instructions. The PM bits are cleared by 
RS. 


Sign-Extension Mode bit. SXM = 1 produces sign extension on data as it 
is passed into the accumulator through the scaling shifter. SXM = 0 sup- 
presses sign extension. SXM does not affect the definition of certain in- 
structions; e.g., the ADDS instruction suppresses sign extension 
regardless of SXM. This bit is set and reset by the SSXM and RSXM in- 
structions, and may also be loaded by LST1. SXM is set to 1 by RS. 


Test/Control Flag bit. The TC bit is affected by the BIT, BITT, CMPR, 
LST1, and NORM instructions. The TC bit is set to a 1 if a bit tested by | 
BIT or BITT is a 1, if a compare condition tested by CMPR exists between 
ARO and another AR pointed to by ARP, or if the exclusive-OR function 
of the two MSBs of the accumulator is true when tested by a NORM in-: 
struction. Two branch instructions, BBZ and BBNZ, provide branching on 
the status of the TC. 


Transmit Mode bit. TXM = 1 configures the serial port’s FSX pin to be an 
output. In this mode, a pulse is produced on FSX when DXR is loaded. 
Transmission then starts on the DX pin. TXM = 0 configures the FSX pin 
to be an input. TXM is set and reset by the STXM and RTXM instructions 
and may also be loaded by LST1. RS resets TXM to 0. 


XF pin status bit. This status bit indicates the state of the XF pin, a gen- 
eral-purpose output pin. XF is set and reset by the SXF and RXF in- 
structions or may be loaded by LST1. XF is set to 1 by RS. 


3.6.5 Timer Operation 
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The TMS320C2x provides a memory-mapped 16-bit timer (TIM) register and 
a 16-bit period (PRD) register, as shown in Figure 3-20. The on-chip timer 
is a down counter that is continuously clocked by CLKOUT1 on the 
TMS320C25. The timer on the TMS32020 is clocked by a signal whose fre- 
quency is CLKOUT1/4 or whose period is 4 x CLKOUT1 cycles. 


The TIM register is set to the maximum value (>FFFF) on reset for both the 
TMS32020 and TMS320C25. The PRD register on the TMS320C25 is also 
initialized by reset to >FFFF. The TMS32020 requires a software initialization 
of the PRD register (see Example 5-1). The TIM register begins decrementing 
only after RS is de-asserted. Following this, the TIM and PRD registers may 
be reloaded under program control. See Section 3.6.3 for reset information. 
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t The divide ratio where N = 4 on the TMS32020 and N = 1 on the TMS320C25. 


Figure 3-20. Timer Block Diagram 


The TIM register, data memory location 2, holds the current count of the timer. 
At every N x CLKOUT1 cycle where N = 4 on the TMS32020 and N = 1 on 
the TMS320C25, the TIM register is decremented by one. The PRD register, 
data memory location 3, holds the starting count for the timer. A timer inter- 
rupt (TINT) is generated every time the timer decrements to zero. The timer 
is reloaded with the value contained in the period (PRD) register within the 
next cycle after it reaches zero so that interrupts can be programmed to occur 
at regular intervals of (PRD + 1) cycles of CLKOUT1 on the TMS320C25 or 
(4 x PRD) cycles of CLKOUT1 on the TMS32020. This feature is useful for 
control operations and for synchronously sampling or writing to peripherals. 
By programming the PRD register from 1 to 65,535 (>FFFF), a TINT can be 
_generated every 2 to 65,536 cycles on the TMS320C25. Note that, on the 
TMS32020, a TINT can be generated every 4 to 262,140 cycles. A PRD reg- 
ister value of zero is not allowed. 


The timer and period registers can be read from or written to on any cycle. The 
count can be monitored by reading the TIM register. Anew counter period can 
be written to the period register without disturbing the current timer count. 
The timer will then start the new period after the current count is complete. If 
both the PRD and TIM registers are loaded with a new period, the timer begins 
decrementing the new period without generating an interrupt. Thus, the pro- 
grammer has complete control of the current and next periods of the timer. 


If the timer is not used, TINT should be masked or all maskable interrupts 
disabled by a DINT instruction. The PRD register can then be used as a gen- 
eral-purpose data memory location. If TINT is used, the PRD and TIM registers 
should be programmed before unmasking the TINT. 


3-45 


Architecture - System Control 


3.6.6 Repeat Counter 


The repeat counter (RPTC) is an 8-bit counter, which when loaded with a 
number N, causes the next single instruction to be executed N + 1 times. The 
RPTC can be loaded with a number from 0 to 255 using either the RPT (re- 
peat) or RPTK (repeat immediate) instructions. This results in a maximum of 
256 executions of a given instruction. RPTC is cleared by reset. 


The repeat feature can be used with instructions such as multiply/accumulates 
(MAC/MACD), block moves (BLKD/BLKP), I/O transfers (IN/OUT), and ta- 
ble read/writes (TBLR/TBLW). These instructions, which are normally multi- 
cycle, are pipelined when using the repeat feature, and effectively become 
single-cycle instructions. For example, the table read instruction may take 
three or more cycles to execute, but when repeated, a table location can be 
read every cycle. Note that not all instructions can be repeated (see Section 
4.3 and Appendix D for more information). 


3.6.7 Powerdown Mode (TMS320C25) 
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When operated in the powerdown mode, the TMS320C25 enters a dormant 
state and requires approximately one-half the power normally needed to sup- 
ply the device (see the data sheet, Appendix A). Powerdown mode is invoked 
either by executing an IDLE instruction or by driving the HOLD input low while 
the HM status bit is set to one. 


While in powerdown mode, all of the internal contents of the TMS320C25 are 
maintained to allow operation to continue unaltered when powerdown mode 
is terminated. During that time, the data and address lines and the contro! 
signals (PS, DS, IS, STRB, and R/W) are all maintained in the high-impedance 
state. Powerdown mode is terminated upon receipt of an interrupt when an 
IDLE instruction is being executed or by removal of the HOLD input. (Refer to 
the IDLE instruction description in Section 4 and the hold function description 
in Section 3.10.3 for further information.) 
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3.7 External Memory and I/O Interface 


The TMS320C2x supports a wide range of system interfacing requirements. 
Data, program, and I/O address spaces provide interfacing to memory and I/O, 
thus maximizing system throughput. The local memory interface consists of: 


@ A 16-bit parallel data bus (D15-D0), 

® A 16-bit address bus (A15-A0), 

° Data, program, and !/O space select (DS, PS, and IS) signals, and 
e Various system control signals. 


ine R/W (read/write) signal controls the direction of the transfer, and STRB 
(strobe) provides a timing signal to control the transfer. 


The TMS320C2x I/O space consists of 16 input and 16 output ports. These 
ports provide the full 16-bit parallel 1/O interface via the data bus on the de- 
vice. A single input or output operation, using the IN or OUT instructions, 
typically takes two cycles; however, when used with the repeat counter, the 
operation becomes single-cycle. 


1/O design is simplified by having 1/O treated the same way as memory. I/O 
devices are mapped into the I/O address space using the processor's external 
address and data buses in the same manner as memory-mapped devices. 
When addressing internal memory, the data bus must be in the high-impe- 
dance state and the control signals go to an inactive state (logic high). Refer 
to Section 5 for the effect instructions have on 1|/O. 


Interfacing to memory and 1/O devices of varying speeds is accomplished by 
using the READY line. When communicating with slower devices, the 
TMS320C2x processor waits until the other device completes its function, 
signals the processor via the READY line, and continues execution (see Sec- 
tion 6). 


3.7.1 Memory Combinations 


The exact sequence of operations performed as instructions execute depends 
on the areas in memory where the instructions and operands are located. There 
are six possible combinations of program and data memory since information 
can be located in either internal RAM, external memory, or internal ROM 
(available only on the TMS320C25). The six possible combinations are: 


@ Program Internal RAM/Data Internal (P1/D1) 

Program Internal RAM/Data External (PI/DE) 

Program External/Data Internal (PE/D]) 

Program External/Data External (PE/DE) 

Program Internal ROM/Data Internal (PR/DI) on the TMS320C25 
Program Internal ROM/Data External (PR/DE) on the TMS320C25. 


Appendix D provides cycle timings for instructions both when repeated and 
when not repeated. The following is a summary of program execution, organ- 
ized according to memory configuration. 
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PI/DI or PR/DI When both program and data memory are on-chip, 
the processor runs at full speed with no wait 
states. Note that IN and OUT instructions have 
different cycle timings when program memory is 
internal; IN requires two cycles to execute whereas 
OUT requires only one cycle. 


PE/DI ‘This memory mode can run at full speed if external 
program memory is sufficiently fast since internal 
data operations can occur coincident with external 
program memory accesses. If external program 
memory is not fast enough, wait states may be 
generated using the READY input. 


PI/DE, PE/DE, or PR/DE Additional cycles are required to execute in- 
structions that reference an external data memory 
space. At least two cycles are required to execute 
‘read from external data memory’ instructions such 
as ADD, LAR, etc. Further additional cycles may 
be required due to wait states if external data 
memory is not fast enough to be accessed within 
a single cycle. Note, however, that the 
TMS320C25 has the capability of executing ‘write 
to external data memory’ instructions in a single 
cycle when program memory is internal (two cy- 
cles are required if program memory is also ex- 
ternal). Additional cycles are also required in this 
case if external data memory is not sufficiently fast. 


in all memory configurations where the same bus is used to communicate with 
external data, program, or 1/O space, the number of cycles required to execute 
a particular instruction may further vary depending on whether the next in- 
struction fetch is from internal or external program memory. Instruction exe- 
cution and operation of the pipeline are discussed in Section 3.6.2 and in the 
succeeding subsections. 


3.7.2 Internal Clock Timing Relationships 
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The crystal or external clock source frequency is divided to produce an internal 
four-phase clock. The four phases are defined by CLKOUT1 and CLKOUT2, 
as shown in Figure 3-21. In this document (and on the TMS320C25), the 
start of quarter-phase 3 (Q3) is defined as the rising edge of CLKOUT1. Refer 
to Appendix C for device phase definitions. 
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Figure 3-21. Four-Phase Clock 


3.7.3. General-Purpose !/O Pins (BIO and XF) 


The TMS320C2x has two general-purpose pins that are software-controlled. 
The BIO pin is a branch control input pin, and the XF pin is an external flag 
output pin. 


The BIO pin is useful for monitoring peripheral deviee status. It is especially 
useful as an alternative to using an interrupt when it ts necessary not to disturb 
time-critical loops. When the BIO input pin is active (low), execution of the 
BIOZ instruction causes a branch to occur. 


In Figure 3-22, BiO is sampled at the end of Q4 (Q2 on the TMS32020). The 
timing diagram shown is for a sequence of single-cycle, single-word in- 
structions without branches located in external memory. Because of variations 
in pipelining due to instructions prior to and following the BIOZ instruction, 
this timing may vary. Therefore, it is recommended that several cycles of setup 
be provided if BIO is to be recognized on a particular cycle. 


3-49 


Architecture - External Memory and |/O Interface 


3-50 


CLKOUT1 a a en oe i ee / \ 


CLKOUT2 
STRB 
A15-AO BX VALID x 
: (BRANCH: (NEXT (NEXT INSTRUCTION) 
(BI0Z) =: ADDRESS) : INSTRUCTION) N+3 OR BRANCH 
fetch: N N+ : N+2 .¢ ADDRESS _,. 


Figure 3-22. BIO Timing Diagram 


The XF (external flag) output pin is set to a high level by the SXF (set external 
flag) instruction and reset to a low level by the RXF (reset external flag) in- 
struction. XF is set high by RS. 


The relationship between the time the SXF/RXF instruction is fetched before 
the XF pin is set or reset is shown in Figure 3-23. As with Bi6, the timing 
shown for XF is for a sequence of single-cycle, single-word instructions lo- 
cated in external memory. Actual timing may vary with different instruction 
sequences. 
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NOTES: 1. N is the program memory location for the current instruction. 
2. This example only shows the execution of single-cycle instructions 
fetched from external program memory. 


1 N +2 : N +3 : 


Figure 3-23. External Flag Timing Diagram 
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3.8 Interrupts 


3.8.1 
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The TMS320C2x has three external maskable user interrupts (INT2-INTO), 
available for external devices that interrupt the processor. Internal interrupts 
are generated by the serial port (RINT and XINT), by the timer (TINT), and 
by the software interrupt (TRAP) instruction. Interrupts are prioritized with 
reset (RS) having the highest priority and the serial port transmit interrupt 
(XINT) having the lowest priority. 


Interrupt Operation 


This subsection details interrupt organization and management. Vector lo- 
cations and priorities for all internal and external interrupts are shown in Table 
3-6. The TRAP instruction, used for software interrupts, is not prioritized but 
is included here since it has its own vector location. Each interrupt address 
has been spaced apart by two locations so that branch instructions can be 
accommodated in those locations if desired. 


Table 3-6. Interrupt Locations and Priorities 


INTERRUPT MEMORY 
ail LOCATION PRIORITY FUNCTION 


1 (highest) External reset signal 
External user interrupt #0 
External user interrupt #1 


External user interrupt #2 
Reserved locations 
Internal timer interrupt 
Serial port receive interrupt 
7 (lowest) Serial port transmit interrupt 
N/A TRAP instruction address 


When an interrupt occurs, it is stored in the 6-bit Interrupt Flag Register (IFR). 
This register is set by the external user interrupts INT(2-O0) and the internal 
interrupts RINT, XINT, and TINT. Each interrupt is stored in the JFR until it is 
recognized, and then automatically cleared by the IACK (interrupt acknowl- 
edge) signal or the RS (reset) signal. The RS_ signal is not stored in the IFR. 
No instructions are provided for reading from or writing to the IFR. 


The TMS320C2x has a memory-mapped Interrupt Mask Register (IMR) for 
masking external and internal interrupts. The layout of the register is shown 
in Figure 3-24. A 1 in bit positions 5 through O of the IMR enables the cor- 
responding interrupt, provided that INTM = 0. The IMR is accessible with 
both read and write operations but cannot be read using BLKD. When the 
IMR is read, the unused bits (15 through 6) are read as one’s. The lower six 
bits are used to write to or read from the IMR. Note that RS is not included 
in the IMR, and therefore the IMR has no effect on reset. 
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Figure 3-24. Interrupt Mask Register (IMR) 


The INTM (interrupt mode) bit, which is bit 9 of status register STO, enables 
or disables all maskable interrupts. INTM = 0 enables all the unmasked in- 
terrupts, and INTM = 1 disables these interrupts. The INTM is set to 1 by the 
TACK (interrupt acknowledge) signal, the DINT instruction, or a reset. This bit 
is reset to O by the EINT instruction. Note that the INTM does not actually 
modify the IMR or IFR. 


The TMS320C2x has a built-in mechanism for protecting multicycle in- 
structions from interrupts. If an interrupt occurs during a multicycle instruc- 
tion, the interrupt is not processed untiJ the instruction is completed. This 
mechanism also applies to instructions that become multicycle due to the 
READY signal. 


In addition, the device does not allow interrupts to be processed when an in- 
struction is being repeated via the RPT or RPTK instructions. The interrupt is 
stored in the IFR until the repeat counter (RPTC) decrements to zero, and then 
the interrupt is processed. Even if the interrupt is de-asserted while the 
TMS320C2x is processing the RPT or RPTK, the interrupt will still be latched 
by IFR and pending until RPTC decrements to zero. 


If both the HOLD line and an interrupt go active during a multicycle instruction 
or a repeat loop, the HOLD takes control of the processor at the end of the in- 
struction or loop. When HOLD is released, the interrupt is acknowledged. 


Interrupts cannot be processed between EINT and the next instruction in a 
program sequence. For example, if an interrupt occurs during an EINT in- 
struction execution, the device always completes EINT as well as the following 
instruction before the pending interrupt is processed. This insures that a RET 
can be executed before the next interrupt is processed, assuming that a RET 
instruction follows the EINT. The state of the machine, upon receiving an in- 
terrupt, may be saved and restored (see Section 5.3.1). 


3.8.2 External Interrupt Interface 


Interrupts may be asynchronously edge- or level-triggered. In the functional 
logic organization for INT(2-0), shown in Figure 3-25, the external interrupt 
INTO is connected to an edge-triggered flip-flop. The INTO signal is ORed with 
the interrupt edge flip-flop Q output and synchronized with internal quarter- 
phases 1 and 2 to produce an interrupt signal (see Appendix B for phase re- 
lationships on the TMS32020). In this way, the device can handle both 
edge-triggered and level-triggered interrupts. 


Due to the level sensitivity of the external interrupts and the synchronization 
of the interrupts (first on Q2, then on Q1 of the following machine cycle), it 
is necessary to bring the INT line inactive high at least two cycles before ena- 
bling interrupts (EINT). If this criteria is not met, the TMS320C25 will imme- 
diately take the interrupt trap following the EINT plus the next instruction. 
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If the INTM bit and flag register have been properly enabled, the interrupt 
signal is accepted by the processor. An IACK (interrupt acknowledge) signal 


is then generated. The IACK clears the appropriate interrupt edge flip-flop and 
disables the INTM latch. The logic is the same for INT1 and INT2. 
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Figure 3-25. Internal Interrupt Logic Diagram 


In a typical interrupt (INT2-INTO) operation, the interrupt is generated by a 
negative-going edge and the IFR bit is set. Since INTM is disabled when the 
interrupt is acknowledged, the level may continue to be present on the INT 
input without generating further interrupts. If the level is removed before an 
EINT instruction is executed, no further interrupts are generated. If a low level 
continues to be present after the EINT, another interrupt is generated after the 
EINT/next instruction sequence. !n addition, if the INT pin is pulsed between 
the previous IACK and EINT, another interrupt is generated after EINT/RET, 
because the corresponding IFR bit is again set. 


Figure 3-26 shows an interrupt, interrupt acknowledge, and various other 
signals for the special case of single-cycle instructions. An interrupt generated 
during the current (N) fetch cycle still allows the fetch and execution of that 
instruction. The N+1 and N+2 instructions are also fetched, then discarded, 
and the address N+1 is pushed onto the top of the stack. The instruction is 
fetched again upon a return command from the interrupt routine. 
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Two dummy execute cycles occur on an interrupt, as shown in the timing di- 
agram for the TMS320C25 (Figure 3-26). The IACK signal is asserted low. 
during CLKOUT1 low when the device initiates a fetch from interrupt location 
|. Therefore, an external device can determine the interrupt that occurred by 
latching the address bus value present on A4-A1 with the rising edge of 
CLKOUT2 when IACK is low. 


CLKOUT1 


CLKOUT2 


STRB 


INT2-INTO 


A15-AO0 


fatch 


execute 


IACK 


NOTES: 1. N is the program memory location for the current instruction. 
2. Lis the interrupt vector location in program memory for the active interrupt. 
3. For simplicity, this example only shows the execution of single-cycle instructions 
fetched from external program memory, rather than multicycle instructions. 


Figure 3-26. Interrupt Timing Diagram (TMS320C25) 
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3.9 Serial Port 
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A full-duplex on-chip serial port provides direct communication with serial 
devices such as codecs, serial A/D converters, and other serial systems. The 
interface signals are compatible with codecs and many other serial devices 
with a minimum of external hardware. The serial port may also be used for 
intercommunication between processors in multiprocessing applications. 


Both receive and transmit operations are double-buffered on the TMS320C25, 

‘thus allowing a continuous bit stream even if FSX is an output. The use of the 
frame sync mode (FSM) bit provides continuous operation that once initiated 
requires no further frame synchronization pulses. No minimum CLKR/CLKX 
frequency (fmin = 0-Hz) is required for serial port operation. 


The bits, pins, and registers that control serial port operation are listed in Table 
3-7. Availability of a function on a particular device is also indicated. 


Table 3-7. Serial Port Bits, Pins, and Registers 


SERIAL PORT BITS/PINS/REGISTERS TMS32020 | TMS320C25 


FO Format bit Yes Yes 
Transmit mode bit Yes Yes 
Frame synchronization mode bit No Yes 


Transmit clock signal 
CLKR_ Receive clock signal 
DX Transmitted serial data signal 

DR Received serial data signal 

Transmit framing synchronization signal 
Receive framing synchronization signal 


Data transmit register 
DRR Data receive register 

Transmit shift register 
Receive shift register 


The serial port uses two memory-mapped registers: the data transmit register 
(DXR) that holds the data to be transmitted by the serial port, and the data 
receive register (DRR) that holds the received data (see Figure 3-27). Both 
registers operate in either the 8-bit byte mode or 16-bit word mode, and may 
be accessed in the same manner as any other data memory location. Each 
register has an external clock, a framing synchronization pulse, and associated 
shift registers. Any instruction accessing data memory can be used to read 
from or write to these registers; however, the BLKD (block move from data 
memory to data memory) instruction cannot be used to read these registers. 
The DXR and DRR registers are mapped into locations 0 and 1 in the data 
address space. The XSR and RSR registers are not directly accessible through 
software. 
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Figure 3-27. The DRR and DXR Registers 


If the serial port is not being used, the DXR and DRR registers can be used 
as general-purpose registers. In this case, the CLKR or FSR should be con- 
nected to a logic Jjow to prevent a possible receive operation from being initi- 
ated. 


Three bits in status register ST1 are used to control the serial port operation: 
FO, TXM, and FSM. The FO (format) bit defines whether data to be trans- 
mitted and received is an 8-bit byte or a 16-bit word. If FO = 0, the data is 
formatted in 16-bit words. If FO = 1, the data is formatted in 8-bit bytes. In 
the 8-bit mode, only the eight least-significant bits are used for 
transmit/receive operations. The FO bit is loaded by the FORT (format serial 
port registers) instruction. On reset, FO is set to 0. 


The TXM (transmit mode) bit is used to determine if the frame synchronization 
pulse for the transmit operation is generated externally or internally. If TXM 
= 1, the FSX pin becomes an output pin, and a framing pulse is produced on 
the FSX pin every time the DXR register is loaded. This framing pulse is syn- 
chronized with the rising edge of CLKX. If TXM = 0, the FSX pin becomes 
an input pin. The TMS320C2x then waits for an external synchronization 
pulse before beginning transmission. On a reset, TXM is set to zero, config- 
uring FSX to be an input. The TXM bit can be loaded by the LST1, STXM, 
or RTXM instructions. If DXR on the TMS32020 is loaded before the previous 
word is completely sent, the serial port immediately begins transmitting the 
new word. The bits of the previous word that have not been sent are Jost. If 
TXM = 1 on the TMS32020, a new FSX pulse is generated. If TXM = 0, the 
serial port immediately begins transmitting the new word without waiting for 
a new external FSX pulse. 


The FSM (frame synchronization mode) status register bit is used to select 
whether frame sync pulses are required for each serial port transfer. If FSM = 
1, frame sync pulses are required; if FSM = 0, they are not required. FSM is 
set by the SFSM (set frame synchronization mode) instruction and cleared by 
the RFSM (reset frame synchronization mode) instruction. When FSM = 1 
and frame sync pulses are required, an FSX pulse will cause the XSR to be 
loaded with data from the DXR, and transmission will begin. If an FSX is pre- 
sented prior to the last bit of the current transmission, the XSR will be reloaded 
from the DXR, thus aborting the current transmission and immediately begin- 
ning a new one. 


The frame sync mode is useful in communicating to ‘PCM highways.’ For ATT 
T1 and CCITT G711/712 lines, the processor can communicate directly in 
these formats by counting the transmitted/received bytes in software and 
performing SFSM/RFSM instructions as needed to set/reset the FSM bit. 
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3.9.1 Transmit and Receive Operations 


The transmit and receive sections of the serial port are implemented separately 
to allow independent transmit and receive operations. Externally, the serial 
port interface is implemented using the six serial port pins. Figure 3-28 shows 
the registers and pins used in transmit and receive operations. Note that on the 
TMS32020, the DXR and XSR are combined as a single register, as well as the 
DRR and RSR. 
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Figure 3-28. Serial Port Block Diagram 


The data on the DX and DR pins is clocked out of or into the XSR or RSR on 
the TMS320C25 by the CLKX or CLKR signal, respectively. On the 
TMS32020, the data on the pins is clocked directly out of the DXR or into the 
DRR. CLKX and CLKR are only required to be present during actual serial port 
transfers, and may be stopped when no data is being transferred. Data bits 
can be transferred in either 8-bit bytes or 16-bit words. Data is clocked out to 
DXR on the rising edges of CLKX, while data is clocked in from DRR on the 
falling edges of CLKR. The MSB of the data is transferred first. 


The XSR and RSR are connected to the DXR and DRR, respectively. For 
transmit operations, the contents of DXR are transferred to XSR when a new 
transmission begins. For a receive operation, the contents of RSR are trans- 
ferred to DRR when all of the bits have been received. Thus, the serial port is 
double-buffered since data may be transferred to or from the DXR or DRR 
while another transmit or receive operation is being performed. 
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Serial port transfers on the TMS320C25 are generally initiated by a frame sync 
pulse. The exception to this is when the continuous mode of operation is used 
with FSM = 0, as described in a subsequent paragraph. Frame sync pulses are 
input on FSX for transmit operations and on FSR for receive operations. 


The transmit timing diagram is shown in Figure 3-29. The transmit operation 
begins when data is written into the data transmit register (DXR). The 
TMS320C2x begins transmitting data when the frame synchronization pulse 
(FSX) goes low while CLKX is high or going high. The data, starting with the 
MSB, is then shifted out via the DX pin with the rising edge of CLKX. When 
all bits have been transmitted, an internal transmit interrupt (XINT) is gener- 
ated on the falling edge of CLKX. When the serial port is not transmitting, 
DX is placed in the high-impedance state. 


DX and FSX are unaffected by assertion of the HOLD input. Upon assertion 
of HOLD, any seria! port transmission in progress on the DX pin is completed 
before DX is placed in the high-impedance state. FSX remains configured as 
either an input or output, remaining low if it is an output. 


CLKX 
FSX 
(TXM=1) 


DX 


‘4 or 16 BITS¢+—___+»__> 


XINT ; - 


Figure 3-29. Serial Port Transmit Timing Diagram 


The receive operation is similar to the transmit operation. The receive timing 
diagram is shown in Figure 3-30. Reception is initiated by a frame synchro- 
nization pulse on the FSR pin. After FSR goes low, data on the DR pin is 
clocked into the RSR register on the TMS320C25 (DRR register on the 
TMS32020) on every negative-going edge of CLKR. The first data bit is 
considered the MSB, and RSR is filled accordingly. After all the bits have 
been received, (as specified by FO), an internal receive interrupt (RINT) is 
generated on the falling edge of CLKR and the contents of RSR are transferred 
to DRR. !f DRR is read before an RINT is received, the bits that have not been 
clocked in yet at the time of the read will contain the data from a previous 
transfer. To prevent a possible overrun of the DRR register, the DRR must be 
read before the next FSR. 
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Figure 3-30. Serial Port Receive Timing Diagram 


3.9.2 Timing and Framing Control 
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Upon completion of a serial port transfer, an internal interrupt is generated. 
The RINT interrupt is generated for a receive operation, and XINT is generated 
for a transmit operation. RINT and XINT are generated on the rising edge of 
CLKR and CLKX, respectively, after the last bit is transferred. Note that if DRR 
is read before a RINT is received, it will contain the data from the previous 
operation. Similarly, if DXR is loaded more than once after an XINT is gener- 
ated (in the continuous transmission mode), only the last value written will 
be loaded into XSR for the next transmit operation. 


When the TMS320C2x is reset, TXM is cleared to zero, and DX is placed in a 
high-impedance state. Any transmit or receive operation that is in progress 
when the reset occurs is terminated. 


The transmit framing synchronization pulse can be generated internally or ex- 
ternally. The maximum speed of the serial port is 5 MHz. The timing of the 
serial port signals is compatible with the TI/Intel 29C1x series codecs. The 
timing is also compatible with the AMI S3506 series codecs if the frame syn- 
chronization signals are inverted. 


Serial port transfers on the TMS320C25 are generally initiated by a frame sync 
pulse, except when the continuous mode of operation is used with FSM = 0. 
Frame sync pulses are input on FSX for transmit operations and on FSR for 
receive operations. If FSM = 1, frame sync pulses are required; if FSM = 0, 
they are not required. FSM is set by the SFSM (set frame synchronization 
mode) instruction and cleared by the RFSM (reset frame synchronization 
mode) instruction. 
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3.9.3  Burst-Mode Operation 


in burst-mode serial port operation, transfers are separated in time by periods 
of no serial port activity (the serial port does not operate continuously). For 
burst-mode operation, FSM must be set to one. Timing of the serial port in 
this mode of operation is shown in Figure 3-31 and Figure 3-32. 


CLKX 


FSX 
(TXM=1) 


DX 
(FO=1) 


XINT 


DXR 
LOADED RELOADED 


XSR 


XSR 
LOADED RELOADED 
(DURING CLKX LOW) 


Figure 3-31. Burst-Mode Serial Port Transmit Operation 
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Figure 3-32. Burst-Mode Serial Port Receive Operation 


When TXM = 1 (in the transmit operation) and the serial port register DXR is 
loaded, a framing pulse is generated on the next rising edge of CLKX. XSR is 
loaded with the current contents of DXR while FSX is high and CLKX is low. 
Transmission begins when FSX goes low while CLKX is high or is going high. 
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Figure 3-31 shows the timing for the byte mode (FO = 1). XINT is generated 
on the rising edge of CLKX after all 8 or 16 bits have been transmitted and 
DX is placed in the high-impedance state. If DXR is reloaded before the next 
rising edge of CLKX after XINT, FSX will again be generated as shown, and 
XSR will be reloaded. 


The receive operation is similar to the transmit operation. The contents of RSR 
are loaded into DRR while CLKR is low, just after reception of the last bit sent 
by the transmitting device (see Figure 3-32). RINT is generated on the next 
rising edge of CLKR, and DRR may be read at any time before the reception 
of the final bit of the next transmission. When operating in the byte mode, the 
eight MSBs of the DRR are the contents of the eight LSBs of the DRR prior 
to reception of the current byte, as shown in Figure 3-33 for the TMS320C25. 
On the TMS32020, the most-significant byte is unaffected by successive 8-bit 
receive operations. 
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Figure 3-33. Byte-Mode DRR Operation (TMS320C25) 


3.9.4 Continuous Operation Using Frame Sync Pulses (TMS320C25) 
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The TMS320C25 provides two modes of operation that allow the use of a 
continuous stream of serial data. When FSM = 1, frame sync pulses are re- 
quired. Since DXR is double-buffered, continuous operation is achieved even 
if TXM = 1. Writing to DXR during a serial port transmission does not abort 
the transmission in progress, but instead DXR stores that data until XSR can 
be reloaded. As long as DXR is reloaded before the CLKX rising edge on the 
final bit being transmitted, the FSX pulse will go high on the rising edge of 
CLKX during the transmission of the final bit and fail on the next rising edge 
when transmission of the word just loaded begins. !f DXR is not reloaded 
within this period and FSM = 1, the DX pin will be placed in a high-impe- 
dance state for at least one CLKX cycle until DXR is reloaded (as described in 
the previous section). Figure 3-34 and Figure 3-35 show the timing diagrams 
for the continuous operation with frame sync pulses. 
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Figure 3-34. Serial Port Transmit Continuous Operation (FSM = 1) 
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Figure 3-35. Serial Port Receive Continuous Operation (FSM = 1) 


Continuous receive operation with FSM = 1 is identical to that of burst-mode 
operation with the exception that FSR is pulsed during reception of the final 
bit. 
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3.9.5 Continuous Operation Without Frame Sync Pulses (TMS320C25) 
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The continuous mode of operation on the TMS320C25 allows transmission 
and reception of a continuous bit stream without requiring frame sync pulses 
every 8 or 16 bits. This mode is selected by setting FSM = 0. 


Figure 3-36 and Figure 3-37 show operation of the serial port for both states 
of FSM to illustrate differences in operation for each case. FSM is initially set 
to one, and frame sync pulses are required to initiate serial transfers. Before the 
completion of the transmission (i.e., before the next serial port interrupt), the 
FSM but must be reset to zero by means of an RFSM (reset FSM) instruction. 
RFSM can occur either before or after the write to DXR or read from DRR. 
From this point on, the FSX and FSR inputs are ignored, with transmission 
occurring every CLKX cycle and reception occurring every CLKR cycle as long 
as those clocks are present. 


If FSX is configured as an output, it will remain low until FSM is set back to 
one and DXR is reloaded. If DXR is not reloaded with new data every XINT 
(every 8 or 16 CLKX cycles depending on FO), the last value loaded will be 
transmitted on DX continuously. Note that this is different from the case with 
FSM = 1 where DX is placed into a high-impedance state if DXR is not re- 
loaded before transmission of the last bit of the current word in XSR. For ex- 
ample, if byte C is not loaded into DXR as indicated in Figure 3-36, bits of 
byte B (B1-B8) will be retransmitted instead of bits of byte C as shown. 


For receive operations, DRR is loaded from RSR (and an RINT is generated) 
every 8 or 16 CLKR cycles (depending on FO), regardless of whether or not 
DRR has been read. An overrun of DRR is also possible with FSM = 1 if DRR 
is not read before the next RINT. The only way to stop continuous trans- 
mission or reception once started, when FSM = 0, is to either stop CLKX or 
CLKR or to perform an SFSM (set FSM) instruction. 


Continuous transmission without frame sync pulses is very useful in commu- 
nicating directly to telephone system PCM highways. For ATT T1 and CCITT 
G711/712 lines, FSX and FSR pulses are generated only every 24 or 32 bytes. 
By counting the transmitted and received bytes in software after an initial FSX 
or FSR.and performing SFSM and RFSM instructions as required, the 
TMS320C25 can easily be made to communicate in these formats. 
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3.9.6 Initialization of Continuous Operation Without Frame Sync Pulses 
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(TMS320C25) 


FSM is normally initialized during an XINT or RINT service routine to enable 
or disable FSX and FSR, respectively, for the next serial port operation. It is 
necessary to start this mode with FSM = 1 so that the first data transferred 
out of the serial port is the data written to the DXR register. Otherwise, the 
serial port starts transmitting the contents of the shift register before loading 
it with the value stored in the DXR register. Upon each completion of a data 
packet transmission, it loads the data contained in the.DXR register into the 
shift register and continues transmitting. After the first frame pulse has been 
generated by or sent to the TMS320C25, the FSM bit must be reset to 0 using 
the RFSM instruction. This must be done before the next serial port interrupt 
to assure continuous transmission. If continuous transmission is stopped via 
software, this initiation sequence must be repeated to restart the continuous 
mode operation. . 


As shown in Figure 3-38 and Figure 3-39, RFSM may occur before a write to 
DXR, regardless of the state of TXM. If TXM = 1, FSX is generated in a normal 
manner on the next rising edge of CLKX, but only once. If TXM = 0, the 
TMS320C25 waits to transmit until FSX is pulsed, but from then on, the FSX 
input is ignored. Note that just as in the case of continuous-mode operation 
without sync pulses described in Section 3.9.5, the first data written to DXR 
(byte A) is output twice unless DXR is reloaded before the second trans- 
mission is started. It is important to consider this dummy cycle when using 
continuous-mode serial operation. 


The receive timings are the same as those for the transmit operations with 
TXM = 0. The TMS320C25 waits to receive data until FSR is pulsed, but 
thereafter the FSR input is ignored. No dummy cycle is associated with the 


‘receive operation due to the post-buffering nature of DRR as opposed to the 


prebuffering nature of DXR. 
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3.10 Multiprocessing and Direct Memory Access (DMA) 


The flexibility of the TMS320C2x allows configurations to satisfy a wide range 
of system requirements. Some of the system configurations using the 
TMS320C2x are as follows: 


A standalone system (single processor), 
A multiprocessor with devices in parallel, 
A host/slave multiprocessor with shared global data memory space, or 


A peripheral processor interfaced using processor-controlled signals to 
another device. 


These system configurations are made possible by three specialized features 
of the TMS320C2x: the synchronization function utilizing the SYNC input, the 
global memory interface, and the hold function implemented with the HOLD 
and HOLDA pins. The following sections describe these functions in detail. 


' 3.10.1 Synchronization 
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In a multiprocessor environment, the SYNC input can be used to greatly ease 
interface between processors. This input is used to cause each TMS320C2x 
in the system to synchronize their internal clocks, thereby allowing the pro- 
cessors to run in lock-step operation. 


Multiple TMS320C2x devices are synchronized by using common SYNC and 
external clock inputs. A negative transition on SYNC sets each processor to 
internal quarter-phase one (Q1). This transition must occur synchronously 
with the rising edge of CLKIN. On the TMS320C25, there is a two CLKIN 
cycle delay following the cycle in which SYNC goes low, before the synchro- 
nized Q1 occurs. On the TMS32020, there is no delay. 


The timing diagrams for the SYNC input are shown in Figure 3-40 and Figure 
3-41 for the TMS32020 and TMS320C25, respectively. Note that the internal 
clock timing relationships are different in the TMS32020 and TMS320C25 
(see Appendix C and Section 3.7.2). 


CLKIN 


SYNC 


CLKOUT1 


CLKOUT2 


Figure 3-40. Synchronization Timing Diagram (TMS32020) 
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Figure 3-41. Synchronization Timing Diagram (TMS320C25) 


Normally, SYNC is applied while RS is active. If SYNC is asserted after a reset, 
the following can occur: 


1) | The processor machine cycle is reset to Q1, provided that the timing re- 
quirements for SYNC are met. If SYNC is asserted at the beginning of Q1, 
Q3, or 04, the current instruction is improperly executed. If SYNC is as- 
serted at the beginning of Q2, the current instruction is executed prop- 
erly. 


2) If SYNC does not meet the timing requirements, unpredictable processor 
operation occurs. A reset should then be executed to place the processor 
back in a known state. 


3.10.2 Global Memory 


For multiprocessing applications, the TMS320C2x has the capability of allo- 
cating global data memory space and communicating with that space via the 
BR (bus request) and READY control signals. 


Global memory is memory shared by more than one processor; therefore, ac- 
cess to it must be arbitrated. When using global memory, the processor’s ad- 
dress space is divided into local and global sections. The local section is used 
by the processor to perform its individual function, and the global section is 
used to communicate with other processors. 


A memory-mapped global memory allocation register (GREG) specifies part 

of the TMS320C2x’s data memory as global external memory. GREG, which 

is memory-mapped at data memory address location 5, is an eight-bit register 
_ connected to the eight LSBs of the internal D bus. The upper eight bits of lo- 
- cation 5 are nonexistent and read as one’s. 


The contents of GREG determine the size of the global memory space. The 
legal values of GREG and corresponding global memory spaces are shown in 
Table 3-8. Note that values other than those listed in the table lead to frag- 
mented memory maps. 
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Table 3-8. Global Data Memory Configurations 


LOCAL MEMORY GLOBAL MEMORY 
GREG VALUE RANGE # WORDS RANGE # WORDS 


O00000XX >0 - >FFFF 0 
10000000 >0 - >7FFF >FFFF 32,768 
11000000 >0 - >BFFF >FFFF 16,384. 


11100000 >0 - >DFFF >FFFF 8,192 
11110000 >0 - >EFFF >FFFF 4,096 
11111000 >0 - >F7FF >FFFF 2,048 
11111100 >0 - >FBFF >FFFF 1,024 
11111110 >0 - >FDFF > FEEF 512 
41111111 >0 - >FEFF >FFOO - >FFFF 256 


When a data memory address, either direct or indirect, corresponds to a global 
data memory address (as defined by GREG), BR is asserted low with DS to 
indicate that the processor wishes to make a global memory access. External 
logic then arbitrates for control of the global memory, asserting READY when 
the TMS320C2x has control. The length of the memory cycle is controlled by 
the READY line. One wait-state timing is shown in Figure 3-42. Note that all 
signals not shown have the same timing as in the normal read or write case. 


CLKOUT1 


STRB 


A15-A0 


Figure 3-42. Global Memory Access Timing 
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3.10.3 The Hold Function 


The TMS320C2x supports Direct Memory Access (DMA) to its local (off- 
chip) program, data, and 1/O spaces. Two signals, HOLD and HOLDA, are 
provided to allow another device to take control of the processor’s buses. 
Upon receiving a HOLD signal from an external device, the processor ac- 
knowledges by bringing HOLDA low. The processor then places its address 
and ‘data buses as well as all control signals (PS, DS, IS, R/W, and STRB) in the 
high-impedance state. The serial port output pins, DX and FSX, are not af- 
fected by HOLD. Signaling between the external processor and the 
TMS320C2x can be performed using interrupts. 


The timing for the HOLD and HOLDA signals is shown in Figure 3-43. HOLD 
has the same setup time as READY and is sampled at the beginning of quar- 
ter-phase 3 (see Appendix C for phase relationships on the TMS32020). If 
the setup time is met, it takes three machine cycles before the buses and 
contro! signals go to the high-impedance state. Note that unlike the external 
interrupts INT(2-0), HOLD is not a latched input. The external device must keep 
HOLD low until it receives a HOLDA from the TMS320C2x. 


lf the TMS320C2x is in the middle of a multicycle instruction, it will finish the 
instruction before entering the hold state. After the instruction is completed, 
the buses are placed in the high-impedance state. This also applies to in- 
structions that become multicycle due to insertion of wait states or to the use 
of RPT/RPTK instructions. 


After HOLD is de-asserted, program execution resumes from the same point at 
which it was halted. HOLDA is removed synchronously with HOLD, as shown 
in Figure 3-43. If the setup time is met, two machine cycles are required be- 
fore the buses and control signals become valid. 


HOLD Is not treated as an interrupt. If the TMS320C2x was executing the IDLE 
instruction before entering the hold state, it resumes executing IDLE once it 
leaves the hold state. 


The hold function on the TMS320C25 has two distinct operating modes: 


C) A TMS32020-like mode, in which execution is suspended during as- 
~ sertion of HOLD, and 

© A TMS320C25 concurrent DMA mode, in which the TMS320C25 con- 

tinues to execute its proaram while operating from internal RAM or 

ROM, thus greatly increasing throughput in data-intensive applications. 


The operating mode is selected by the HM (hold mode) status register bit on 
the TMS320C25. The HOLD signal is pulled low, as shown in the first part of 
Figure 3-43. When HM = 1, the TMS320C25 halts program execution and 
enters the hold state directly. When HM = 0, the processor enters the hold 
state directly, as shown in Figure 3-43, if program execution is from external 
memory or if external data memory is being ‘accessed. If program execution is 
from internal memory, however, and if no external data memory accesses are 
required, the processor enters the hold state externally, but program execution 
continues internally. This alisws more efficient system operation since a pro- 
gram may continue executiiig while an external DMA operation is being per- 
formed. 
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Program execution ceases until HOLD is removed if the processor is in a hold 
state with HM = O and an internally executing program requires an external 
access, or if the program branches to an external address. Also, if a repeat 
instruction that requires the use of the external bus is executing with HM = 0 
and a hold occurs, the hold state is entered after the current bus cycle. If this 
situation occurs with HM = 1, the hold state will not be entered until the re- 
peat count is completed. HM is set and reset by the SHM (set hold mode) 
and RHM (reset hold mode) instructions, respectively. 


All interrupts are disabled while HOLD is active with HM = 1. If an interrupt 
is received during this period, the interrupt is latched and remains pending. 
HOLD itself does not affect any interrupt flags or registers. If HM = 0, inter- 
rupts function normally. 


CLKOUT1 


STRB 


HOLD 


fetch 


execute 


HOLDA 


NOTES: 1. Nis the program memory location for the current instruction. 
2. This example only shows the execution of single-cycle instructions 
fetched from external program memory. 


Figure 3-43. TMS320C25 Hold Timing Diagram 
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Figure 3-43. TMS320C25 Hold Timing Diagram (Concluded) 
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4. Assembly Language Instructions 


The TMS320C2x instruction set supports numeric-intensive signal processing 
operations as well as general-purpose applications, such as multiprocessing 
and high-speed control. TMS320C1x source code is upward-compatible with 
TMS320C2x source code. TMS32020 object code is upward-compatible with 
TMS320C25 object code. 


This section describes the assembly language instructions for the TMS320C2x 
inicroprocessor. Included in this section are the following major topics: 


® Memory Addressing Modes (Section 4.1 on page 4-2) 
Direct addressing 
Indirect addressing (using eight auxiliary registers) 
Immediate addressing 


° Instruction Set (Section 4.2 on page 4-10) 
Symbols and abbreviations used in the instructions 
instruction set summary (listed according to function) 


@ Individual Instruction Descriptions (Section 4.3 on page 4-17) 
Presented in alphabetical order and providing the following: 
- Assembler syntax 
- Operands 
- Execution 
- Encoding 
- Description 
- Words 
- Cycles 
- Repeatability 
- Example(s) 
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4.1 Memory Addressing Modes 


The TMS320C2x instruction set provides three memory addressing modes: 


e Direct addressing mode 
@ Indirect addressing mode 
e Immediate addressing mode 


Both direct and indirect addressing can be used to access data memory. Direct 
addressing concatenates seven bits of the instruction word with the nine bits 
of the data memory page pointer to form the 16-bit data memory address. 
Indirect addressing accesses data memory through the auxiliary registers. In 
immediate addressing, the data is based on a portion of the instruction 
word(s). The following sections describe each addressing mode and give the 
opcode formats and some examples for each mode. 


4.1.1 Direct Addressing Mode 
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In the direct memory addressing mode, the instruction word contains the 
lower seven bits of the data memory address (dma). This field is concatenated 
with the nine bits of the data memory page pointer (DP) register to form the 
full 16-bit data memory address. Thus, the DP register points to one of 512 
possible 128-word data memory pages, and the 7-bit address in the instruc- 
tion points to the specific location within that data memory page. The DP - 
register is loaded through the LDP (load data memory page pointer), LDPK 
(load data memory page pointer immediate), or LST (load status register STO) 
instructions. 


Note: 


The data page pointer is not initialized by reset and is therefore undefined 
after powerup. The TMS320C2x development tools, however, utilize de- 
fault values for many parameters, including the data page pointer. Because 


of this, programs that do not explicitly initialize the data page pointer may 
execute improperly, depending on whether they are executed on a 
TMS320C2x device or using a development tool. Thus, it is critical that 
all programs initialize the data page pointer in software. 
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Figure 4-1 illustrates how the 16-bit data address is formed. 


7 LSBS FROM 
INSTRUCTION 
REGISTER (IR) 


16-BIT DATA ADDRESS 


Figure 4-1. Direct Addressing Block Diagram 


Direct addressing can be used with all instructions except CALL, the branch 
instructions, immediate operand instructions, and instructions with no oper- 
ands. The direct addressing format is as follows: 


15 1413121110 9 8 7 6 5 4 3 2 1 +0 
| Opcode OT dma 


Bits 15 through 8 contain the opcode. Bit 7 = O defines the addressing mode 
as direct, and bits 6 through O contain the data memory address (dma). 


Example of Direct Addressing Format: 


ADD 9,5 Add to accumulator the contents of data memory location 
9 left-shifted 5 bits. — 


15 1413121110 9 8 7 6 5 4 3 2 1 Q 


000001 01/0/00 001001 


The opcode of the ADD 9,5 instruction is >05 and appears in bits 15 through 
8. The notation >nn indicates nn is a hexadecimal number. The shift count 
of >5 appears in bits 11 through 8 of the opcode. The data memory address 
>0O9 appears in bits 6 through 0. 
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4.1.2 Indirect Addressing Mode 


The auxiliary registers (AR) provide flexible and powerful indirect addressing. 
Five auxiliary registers (ARO-AR4) are provided on the TMS32020, and eight 
auxiliary registers (ARO-AR7) are available on the TMS320C25. To select a 
specific auxiliary register, the Auxiliary Register Pointer (ARP) is loaded with 
a value from O through 4 or 7, designating ARO through AR4 or AR7, respec- 
tively (see Figure 4-2). 


3 AUXILIARY 
REGISTERS 
ARB (3) ARP (3) ARO (16) 


16-BIT DATA ADDRESS 
TtTMS320C25 specific. 


Figure 4-2. Indirect Addressing Block Diagram 


The contents of the auxiliary registers may be operated upon by the Auxiliary 
Register Arithmetic Unit (ARAU), which implements 16-bit unsigned arith- 
metic. The ARAU performs auxiliary register arithmetic operations in the same 
cycle as the execution of the instruction. (Note that the increment or decre- 
ment of the indicated AR is always executed after the use of that AR in the 
instruction.) 


In indirect addressing, any location in the 64K data memory space can be ac- 
cessed via the 16-bit addresses contained in the auxiliary registers. These may 
be loaded by the instructions LAR (load auxiliary register), LARK (load auxil- 
iary register immediate), and LRLK (load auxiliary register long immediate). 
The auxiliary registers on the TMS320C25 may be modified by ADRK (add to 
auxiliary register short immediate) or SBRK (subtract from auxiliary register 
short immediate). The TMS320C2x auxiliary registers may also be modified 
by the MAR (modify auxiliary register) instruction or, equivalently, by the in- 
direct addressing field of any instruction supporting indirect addressing. 
AR(ARP) denotes the auxiliary register selected by ARP | 
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The following symbols are used in indirect addressing, including bit-reversed 
(BR) addressing: 


i Contents of AR(ARP) are used as the data memory address. 


*. Contents of AR(ARP) are used as the data memory address, then 
decremented after the access. 


*+ Contents of AR(ARP) are used as the data memory address, then 
incremented after the access. 


*0- Contents of AR(ARP) are used as the data memory address, and the 
contents of ARO subtracted from it after the access. 


*O+ Contents of AR(ARP) are used as the data memory address, and the 
contents of ARO added to it after the access. 


*BRO- Contents of AR(ARP) are used as the data memory address, and the 
contents of ARO subtracted from it, with reverse carry (rc) propa- 
gation, after the access (TMS320C25 specific). 


*BRO+ Contents of AR(ARP) are used as the data memory address, and the 
contents of ARO added to it, with reverse carry (rc) propagation, af- 
ter the access (TMS320C25 specific). 


There are two main types of indirect addressing with indexing: 


© Regular indirect addressing with increment or decrement, and 


e Indirect addressing with indexing based on the value of ARO: 
Indexing by adding or subtracting the contents of ARO, or 
Indexing by adding or subtracting the contents of ARO with the 
carry propagation reversed (for FFTs on the TMS320C25). 


In either case, the contents of the auxiliary register pointed to by the ARP re- 
gister are used as the address of the data memory operand. Then, the ARAU 
performs the specified mathematical operation on the indicated auxiliary reg- 
ister. Additionally, the ARP may be loaded with a new value. All indexing op- 
erations are performed on the current auxiliary register in the same cycle as the 
original instruction. 


Indirect auxiliary register addressing allows for post-access adjustments of the 
auxiliary register pointed to by the ARP. The adjustment may be an increment 
or decrement by one or based upon the contents of ARO. 


Bit-reversed addressing modes on the TMS320C25 allow efficient I/O to be 
performed for the resequencing of data points in a radix-2 FFT program. The 
direction of carry propagation in the ARAU is reversed when this mode is se- 
lected and ARO ts added to/subtracted from the current auxiliary register. 
Typical use of this addressing mode requires that ARO first be set to a value 
corresponding to one-half of the array size, and AR(ARP) be set to the base 
address of the data (the first data point). See Section 5.7.4 for an FFT example 
using bit-reversed addressing modes. 
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Indirect addressing can be used with all instructions except immediate oper- 
and instructions and instructions with no operands. The indirect addressing 
format is as follows: 


15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 =90 


Bits 15 through 8 contain the opcode, and bit 7 = 1 defines the addressing 
mode as indirect. Bits 6 through 0 contain the indirect addressing control bits. 


Bit 6 contains the increment/decrement value (IDV). The IDV determines 
whether ARO will be used to increment or decrement the current auxiliary 
register. If bit 6 = 0, an increment or decrement (if any) by one occurs to the 
current auxiliary register. If bit 6 = 1, ARO may be added to or subtracted from 
the current auxiliary register as defined by bits 5 and 4. 


Bits 5 and 4 control the arithmetic operation to be performed with AR(ARP) 
and ARO. When set, bit 5 indicates that an increment is to be performed. If bit 
4 is set, a decrement is to be performed. Table 4-1 shows the correspondence 
of bit pattern and arithmetic operation. 


Table 4-1. Indirect Addressing Arithmetic Operations 


BITS ARITHMETIC OPERATION 
5 


No operation on AR(ARP) 

AR(ARP) - 1 > AR(ARP) 

AR(ARP) + 1 ~ AR(ARP) 

Reserved 

AR(ARP) - ARO > AR(ARP) [reverse carry propagation]T 
AR(ARP) - ARO > AR(ARP) 

AR(ARP) + ARO > AR(ARP) 

AR(ARP) + ARO > AR(ARP) [reverse carry propagation]t 


_ tTMS320C25 specific. 


for) 


=o-0o-o0-0]a 


0 
0 
0 
0 
1 
1 
1 
1 


Bit 3 and bits 2 through 0 control the Auxiliary Register Pointer (ARP). Bit 3 
(NAR) determines if a new value is loaded into the ARP. If bit 3 = 1, the 
contents of bits 2 through O (Y = next ARP) are loaded into the ARP. If bit 
3 = 0, the contents of the ARP remain unchanged. 


Table 4-2 shows the bit fields, notation, and operation used for indirect ad- 
dressing. 


we 


¢ 
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Table 4-2. Bit Fields for Indirect Addressing 


INSTRUCTION FIELD BITS NOTATION OPERATION 
15 - 87654321 0 


AR(ARP)-1 — AR(ARP) 
Y ~ ARP 


AR(ARP)+1 — AR(ARP) 


AR(ARP)+1 — AR(ARP) 
Y > ARP 


“Opcode> 110007 > AR(ARP)-rcARO > AR(ARP)t 


“Qpcode> 11001 * Y >| “*BRO-,Y AR(ARP)-rcARO — AR(ARP) 
Y ~ ARPT 


“ Opcode> 1 1 0 1 


AR(ARP)+ARO —~ AR(ARP) 


o1°¢Y> AR(ARP)+ARO > AR(ARP) 


Y > ARP 
“Opcode> 1 1110°Y > AR(ARP)+rcARO > AR(ARP)t 


“Opcode> 1111 1° Y >] *BRO+,Y | AR(ARP)+rcARO ~ AR(ARP) 
Y > ARPT 


TBR = bit-reversed addressing mode and rc = reverse carry propagation (TMS320C25). 


For some instructions, the notation in Table 4-2 includes a shift code, e.g., 
*0+,8,3 where 8 is the shift code and Y = 3. 


The CMPR (compare auxiliary register with ARO), and BBZ/BBNZ (branch if 
TC bit equal/not equal to zero) instructions facilitate conditional branches 
based on comparisons between the contents of ARO and the contents of 
AR(ARP). 


The auxiliary registers may also be used for temporary storage via the load and 
store auxiliary register instructions, LAR and SAR, respectively. 


The following examples illustrate the indirect addressing format: 
Example 1: | 


ADD *+,8 Add to the accumulator the contents of the data memory 
address defined by the contents of the current auxiliary 
register. This data is left-shifted 8 bits before being 
added. The current auxiliary register is autoincremented 
by one. The opcode is >O8A0, as shown below. 


16 .14-°73 127% 10:9. 8 oF 26, 6 AS? TD 


6.0 0-0 1 0 0. Of 4) 0 1 0-00 0: 6 
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Example 2: 
ADD *,8 


Example 3: 
ADD *-,8 


Example 4: 


ADD *0+,8 


Example 5: 
ADD ‘0-,8 


Example 6: 


ADD *+,8,3 


Example 7: 


ADD “*BRO-,8 


Example 8: 


ADD *BRO+,8 


As in Example 1, but with no autoincrement; the opcode 
is >Q880. 


As in Example 1, except that the current auxiliary register 
is decremented by one; the opcode is >0890. 


. As in Example 1, except that the contents of auxiliary 


register ARO are added to the current auxiliary register; 
the opcode is >O8E0. 


As in Example 1, except that the contents of auxiliary 
register ARO are subtracted from the current auxiliary re- 
gister; the opcode is >O8DO. 


As in Example 1, except that the auxiliary register pointer 
(ARP) is loaded with the value 3 for subsequent in- 
structions; the opcode is >O8AB. 


The opcode is >08CO. The contents of auxiliary register 
ARO are subtracted from the current auxiliary register with 
reverse carry propagation (TMS320C25). 


The opcode is >QO8FO. The contents of auxiliary register 
ARO are added to the current auxiliary register with re- 
verse Carry propagation (TMS320C25). 
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4.1.3 Immediate Addressing Mode 


In immediate addressing, the instruction word(s) contains tlic value of the 
immediate operand. The TMS320C2x has both single-word (8-b% and 13-bit 
constant) short immediate instructions and two-word (16-bit constant) long 
immediate instructions. The immediate operand is contained within the in- 
struction word itself in short immediate instructions. In long immediate in- 
structions, the word following the instruction opcode is used as the immediate 
operand. 


The following short immediate instructions contain the immediate operand in 
the instruction word and execute within a single instruction cycle. The length 
of the constant operand is instruction-dependent. Note that the ADDK, ADRK, 
SBRK, and SUBK instructions are available on the TMS320C25. 


ADDK Add to accumulator short immediate (8-bit absolute constant) 

ADRK Add to auxiliary register short immediate (8-bit absolute con- 
stant) 

LACK Load accumulator short immediate (8-bit absolute constant) 

LARK Load auxiliary register sort immediate (8-bit absolute constant) 

LARP Load auxiliary register pointer (3-bit constant) 

LDPK Load data memory page pointer immediate (9-bit constant) 

MPYK Multiply immediate (13-bit two’s-complement constant) 

IPTK Repeat instruction as specified by immediate value (8-bit con- 
stant) 

SBRK Subtract from auxiliary register short immediate (8-bit absolute 
constant) 

SUBK Subtract from accumulator short immediate (8-bit absolute 
constant). 


Example of short immediate addressing format: 
RPTK 99 Execute the instruction following this instruction 100 times. 


With the RPTK instruction, the immediate operand is contained as a part of the 
instruction opcode. The instruction format for RPTK is as follows: 


15 1413121110 9 8 7 6 5 4 3 2 1 =O 


110 0 10 1 14 8-BitConstant_ 
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For long immediate instructions, the constant is a 16-bit value in the word 
following the opcode. The 16-bit value can be optionally used as an absolute 
constant or as a two’s-complement value. 


ADLK Add to accumulator long immediate with shift (absolute or two’s 
complement) 

ANDK AND immediate with accumulator with shift 

LALK Load accumulator long immediate with shift (absolute or two’s 
complement) 

LRLK Load auxiliary register long immediate 

ORK OR immediate with accumulator with shift 

SBLK Subtract from accumulator long immediate with shift (absolute 


or two’s complement) 


XORK Exclusive-OR immediate with accumulator with shift. 


Example of long immediate addressing format: 


ADLK 16384,2 Add to the accumulator the value 16384 with a shift to 
the left of two, effectively adding 65536 to the contents 
of the accumulator. 


The ADLK instruction uses the word following the instruction opcode as the 


immediate operand. The instruction format for ADLK is as follows: 


15 14131211109 8 765 43 21 0 


1 1°01] Shift [0 000001 0 
16-Bit Constant 


4.2 Instruction Set. 


The following sections list the symbols and abbreviations used in the instruc- 
tion set summary and in the instruction descriptions. The complete instruction 
set summary is organized according to function. A detailed description of each 
instruction is listed in the instruction set summary. 


4.2.1 Symbols and Abbreviations 


Table 4-3 lists symbols and abbreviations used in the instruction set summary 
(Table 4-4) and the individual instruction descriptions. 
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Table 4-3. Instruction Symbols 


Port address 

Accumulator 

Auxiliary register pointer buffer 

Auxiliary register n (ARO, AR1 assembler symbols equal to 0 or 1) 
Auxiliary register pointer 
4-bit field specifying a bit code 

Branch control input 

Carry bit 

2-bit field specifying compare mode 

On-chip RAM configuration control bit 

Data memory address field 

Label assigned to data memory location n 

Data memory address 

Data page pointer 

Format status bit 

Frame synchronization mode bit 

Hold mode bit 

Addressing mode bit 

Interrupt mode flag bit 

Immediate operand field 

Microcall stack 

nn = hexadecimal number (others are decimal values) 
Overflow mode flag bit 

Overflow mode bit 

Product register 

Port address (PAO-PA15 assembler symbols equal to 0 through 15) 
Program counter 
Prefetch counter 

2-bit field specifying P register output shift code 
Program memory address 

Label assigned to program memory location n 
3-bit operand field specifying auxiliary register 
Repeat counter 

4-bit left-shift code 

Status register n (STO or ST1) 

Sign-extension mode bit 

Temporary register 

Test control bit 

Top of stack 

Transmit mode bit 

3-bit accumulator left-shift field 

XF pin status bit 

Is assigned to 

An absolute value 

User-defined items 

Optional items 

Contents of 

Alternative items, one of which must be entered 
Blanks or spaces must be entered where shown. 
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4.2.2 Instruction Set Summary 


Table 4-4 shows the instruction set summary for the TMS320C25 processor, 
which is a superset of the TMS320C1x and TMS320C2x instruction sets. 
Included in the instruction set are four special groups of instructions. to im- 
prove overall processor throughput and ease of use. 


@ Extended-precision arithmetic (ADDC, SUBB, MPYU, BC, BNC, SC, 
and RC) 


@ Adaptive filtering (MPYA, MPYS, and ZALR) 
e Control and I/O (RHM, SHM, RTC, STC, RFSM, and SFSM) 


rd Accumulator and register (SPH, SPL, ADDK, SUBK, ADRK, SBRK, ROL, 
and ROR). 


The instruction set summary is arranged according to function and alphabet 
ized within each functional grouping. Additional information is presented in 
the individual instruction descriptions in the following section. The symbol tf 
indicates instructions that are specific to the TMS320C2x instruction set. The 
symbol + indicates instructions that are specific to the TMS320C25 instruction 
set. 
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Table 4-4. Instruction Set Summary 


ACCUMULATOR MEMORY REFERENCE INSTRUCTIONS 


Mnemonic and Description Words 16-Bit Opcode 
MSB LSB 


Absolute value of accumulator 0001 1011 
Add to accumulator with shift |! DDD DDDD 
Add to accumulator with carry | DDD DODD 
Add to high accumulator | DDD DDDD 
Add to accumulator short immediate KKKK KKKK 
Add to low accumulator with sign-extension 1 DDD DDDD 
suppressed 

Add to accumulator with shift specified 

by T register 

Add to accumulator long immediate 

with shift 

AND with accumulator 

AND immediate with accumulator with shift 
Complement accumulator 

Load accumulator with shift 

Load accumulator short immediate 

Load accumulator with shift specified 

by T register 

Load accumulator long immediate 

with shift 

Negate accumulator. 

Normalize contents of accumulator 

OR with accumulator 

OR immediate with accumulator with shift 
Rotate accumulator left 

Rotate accumulator right 

Store high accumulator with shift 

Store low accumulator with shift 

Subtract from accumulator long immediate 
with shift 

Shift accumulator left 

Shift accumulator right 

Subtract from accumulator with shift 
Ssubtract from accumulator with borrow 
Conditional subtract 

Subtract from high accumulator 

Ssubtract from accumulator short immediate 
Subtract from low accumulator with 

sign extension suppressed 

Subtract from accumulator with shift specified 
by T register 

Exclusive-OR with accumulator | DDD DDDD 
Exclusive-OR immediate with accumulator 0000 0110 
with shift 

Zero accumulator 0000 0000 
Zero {ow accumulator and load high accumulator | DDD DDDD 
Zero low accumulator and load high accumulator ! DDD DDDD 
with rounding 

Zero accumulator and load low accumulator | DDD DDDD 
with sign extension suppressed 


TThese instructions are specific to the TMS320C2x instruction set. 
tThese instructions are specific to the TMS320C25 instruction set. 


| DDD DDDD 


np — 


0000 0010 


| DDD DDDD 
0000 0100 
0010 0111 
|.DDD DDDD 
KKKK KKKK 
1! DDD DDDD 


mS em oppo— 


No 


0000 0001 


o 
oO 


0010 0011 
1010 0010 
1 DDD DDDD 
0000 0101 
0011 0100 
0011 0101 
1 DDD DDDD 
| DDD DDDD 
0000 0011 


OPP AB apo - =a 
~"~O0O---]0- == 
_ = = 2 = — - = 
-Cooo0o-Co°o 

More - nN =a 
AXxKXA-HnO-- 


0001 1000 
0001 1001 
| DDD DDDD 
{DDD DDDD 
1 DDD DDDD 
| DDD DDDD 
KKKK KKKK 
1 DDD DDDD 


oo o7rr/-00000 


oo 


— 2 eS S| Ye 


—_— — 


o 
o-oo 
(fo) 
_ 


oo 
kb 
oo 
oo 
-“=O- "NOOO NXKOON~-OCO 
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oO 
_ 
oO 
—_ 


| DDD DDDD 
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Table 4-4. Instruction Set Summary (Continued) 


AUXILIARY REGISTERS AND DATA PAGE POINTER INSTRUCTIONS 


Mnemonic and Description 16-Bit Opcode 


MSB LSB 
Add to auxiliary register short immediate KKKK 
Compare auxiliary register with auxiliary 0101 
register ARO 
LAR Load auxiliary register 
LARK Load auxiliary register short immediate 
LARP Load auxiliary register pointer 
LDP Load data memory page pointer 
LDPK Load data memory page pointer immediate 
LRLKTt Load auxiliary register long immediate 
MAR Modify auxiliary register 
SAR Store auxiliary register ORRR | DDD 
SBRKt Subtract from auxiliary register short immediate 1111 KKKK 


T REGISTER, P REGISTER, AND MULTIPLY INSTRUCTIONS 


Mnemonic and Description 16-Bit Opcode 
MSB LSB 


0011 i DDD 
1100 ORRR KKKK KKKK 
0101 0101 1000 1RRR 
0101 0010 | DDD DDDD 
1100 100K KKKK KKKK 
1101 
0101 
0111 
0111 


ORRR 0000 0000 
0101 ! DDD DDDD 


a ee Ge ee ee ee | § 


Add P register to accumulator 1110 0001 0101 
LPHt Load high P register 0101 0011 | DDD DDDD 
LT Load T register 0011 1100 i DDD DDDD 
LTA Load T register and accumulate previous product 0011 1101 | DDD DDDD 
LTD Load T register, accumulate previous product, 0011 1111 | DOD ODDD 


and move data 

LTPt Load T register and store P register in 
accumulator 

LTSt Load T register and subtract previous 
product 


_ 


0011 1110 1! DDD DDDD 


ob 


0101 1011 | DDD DDDD 


MAcCt Multiply and accumulate 2 0101 1101 | DDD DDDD 
MACDt Multiply and accumulate with data move 2 0101 1100 | DDD DDDD 
MPY Multiply (with T register, store product in 1 0011 1000 | DDD DDDD 
P register) 
MPYA? Multiply and accumulate previous product 1 0011 1010 | DDD DDDD 
MPYK Multiply immediate 1 101K KKKK KKKK KKKK 
MPYS+ Multiply and subtract previous product 1 0011 1011 | DDD DDDD 
MPYUt Multiply unsigned 1 1100 1111 ! DDD DDDD 
PAC Load accumulator with P register 1 1100 1110 0001 0100 
SPAC Subtract P register from accumulator 1 1100 1110 0001 0110 
SPHt Store high P register 1 0111 1101 | DODD DDDD 
SPLt Store low P register 1 0111 1100 | DDD DDDD 
SPMt Set P register output shift mode 1 1100 1110 0000 10KK 
SQRATt Square and accumulate 1 0011 1001 i DDD DDDD 
SQRSt Square and subtract previous product 1 0101 1010 | DDD DDDD 


tThese instructions are specific to the TMS320C2x instruction set. 
tThese instructions are specific to the TMS320C25 instruction set. 
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Table 4-4. Instruction Set Summary (Continued) 


BRANCH/CALL INSTRUCTIONS 


Mnemonic and Description Words 16-Bit Opcode 
MSB LSB 


B Branch unconditionally 1DDD DDDD 
Branch to address specified by accumulator 0010 0101 
Branch on auxiliary register not zero 1DDD ODDD 
Branch if TC bit # O 1DDD DDDD 
Branch if TC bit = 0 1DDD DDDD 
Branch on carry 1DDD DDDD 
Branch if accumulator > 0 1DDD DDDD 
Branch if accumulator > 0 .1DDD DDDD 
Branch on 1/O status = 0 1DDD DDDD 
Branch if accumulator < C 1DDD DDDD 
Branch if accumulator < 0 1DDD DDDD 
Branch on no carry 1DDD DDDD 
Branch if no overflow 1DDD DDDD 
Branch if accumulator # 0 1DDD DDDD 
Branch on overflow 1DDD DDDD 
Branch if accumulator = 0 1DDD DDDD 
Call subroutine indirect 0010 0100 
Call subroutine 1DDD DDDD 
0010 0110 
0001 1110 


BANK NMONNNMNNNNNMNNMNNN bh 


Return from subroutine 
Software interrupt 1 


1/0 AND DATA MEMORY OPERATIONS 


Mnemonic and Description 16-Bit Opcode 
MSB 


ee th ee we ee ae et ath et oe ow a os) 
oOooooorfH-f]-0070007-0- 


1 DDD DDDD 
1 DDD DDDD 


Block move from data memory to data memory 
Block move from program memory to data 
memory 

Data move in data memory 

Format serial port registers 

Input data from port 

Output data to port 

Reset serial port frame synchronization mode 
Reset seria! port transmit mode : 

Reset external flag 

Set serial port frame synchronization mode 
Set serial port transmit mode 

Set external flag 

Table read 

Table write 


NN 
==. 
= otk 
ak od 
—_ 


| DDD DDDD 
0000 111K 
1 DDD DDDD 
1 DDD DDDD 
0011 0110 
0010 0000 
0000 1100 
0011 0111 

0010 0001 

0000 1101 

1 DDD DDDD 
1 DDD DDDD 


ABA ewer oran 
moore rrr noo - 
oooooo0oo-0o0°o 
~Aa~oocooocoeo°o°oo=}- 
awa ssnmapp-o 


tThese instructions are specific to the TMS320C2x instruction set. 
¥These instructions are specific to the TMS320C25 instruction set. 
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Table 4-4. Instruction Set Summary (Concluded) 


CONTROL INSTRUCTIONS 


Mnemonic and Description 16-Bit Opcode 
MSB LSB 


Test bit 

Test bit specified by T register 
Configure block as data memory 
Configure block as program memory 
Disable interrupt 

Enable interrupt 

Idle until interrupt 

Load status register STO 

Load status register ST1 

No operation 

Pop top of stack to low accumulator 
Pop top of stack to data memory 
Push data memory value onto stack 
Push low accumulator onto stack 
Reset carry bit 

Reset hold mode 

Reset overflow mode 

Repeat instruction as specified by data memory 
value 

Repeat instruction as specified by immediate 
value 

Reset sign-extension mode 

Reset test/control flag 

Set carry bit 

Set hold mode 

Set overflow mode 

Store status register STO 

Store status register ST1 

Set sign-extension mode 

Set test/control flag 


eek etd et od ot es et od od ot os oo od 8 


om 


Saar mrss a 
[22 OO Ap aaa 
pa ee ee Ce eee er ere 
oor -o0o0oc0o00 
oonr-o0o0°0°o 
Bw awmoon nana 
=3900O7 23-36 
=|==390OAamraa 
oo-ooo0o00o 


tThese instructions are specific to the TMS320C2x instruction set. 
tThese instructions are specific to the TMS320C25 instruction set. 
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4.3 Individual Instruction Descriptions 


Each instruction in the instruction set summary is described in the following 
pages. Instructions are listed in alphabetical order. Information, such as as- 
sembler syntax, operands, operation, encoding, description, words, cycles, and 
examples, is provided for each instruction. An example instruction is provided 
to familiarize the user with the special format used and explain its content. 
Refer to Section 4.1 for further information on memory addressing. Code ex- 
amples using many of the instructions are given in Section 5 on Software 


Applications. 


EXAMPLE Example Instruction EXAMPLE 


Syntax 


Direct: 
indirect: 
Immediate: 


Operands 


Execution 


Encoding 


Direct: 


Indirect: 


Immediate: 


[<label>] EXAMPLE <dma>[,<shift>] 
[<label>] EXAMPLE {ind}[,<shift>[,<next ARP>]] 
[<label>] EXAMPLE [<constant>] 


Each instruction begins with an assembler syntax expression. The optional 
comment field that concludes the syntax is not included in the syntax ex- 
pression. Space(s) are required between each field (label, command, op- 
erand, and comment fields) as shown in the syntax. The syntax example 
illustrates both direct and indirect addressing, as well as immediate ad- 
dressing in which the operand field includes <constant>. 


The indirect addressing operand options, including bit-reversed (BR) ad- 
dressing, are as follows: 


TMS32020: {*|*¥ +|*-|"O+]*0-} 
TMS320C25: {*]"+|*-]"0+]"0-]" BRO+|"BRO-} 
OQ < dma < 127 


O < next ARP < 7 
O < constant < 255 


Operands may be constants or assembly-time expressions referring to 
memory, I/O and register addresses, pointers, shift counts, and a variety of 
constants. The operand values used in the example syntax are shown. Note 
that the next ARP on the TMS32020 is < 4 for auxiliary registers ARO-AR4. 


(PC) +17 PC 
(ACC) + [(dma) x 2shift] + Acc 
If SXM = 1: 

Then (dma) is sign-extended. 


If SXM = 
Then (dma) is not sign-extended. 


Affects OV; affected by OVM and SXM. 
Affects C (TMS320C25). 


An example of the instruction operation sequence is provided, describing 
the processing that takes place when the instruction is executed. Condi- 
tional effects of status register specified modes are also given. Those bits 
in the TMS320C2x status registers affected by the instruction are also 
listed. 


15 14 13 12 1110 9 8 7 6 5 4 3 2 14 =O 


0 0 0 0 Data Memory Address 


1 0 0 13-Bit Constant 


Opcode examples are shown of both direct and indirect addressing or of the 
use of an immediate operand. 


EXAMPLE Example Instruction EXAMPLE 


Description 


Words 


Cycles 
‘20 
'C25 


‘20 
"C25 


Instruction execution and its effect on the rest of the processor or memory 
contents are described. Any constraints on the operands imposed by the 
processor or the assembler are discussed. The description parallels and 
supplements the information given by the execution block. 


1 


The digit specifies the number of memory words required to store the in- 
struction and its extension words. 


as | ee ee 
oo ee ee 


Cycle Timings for a Repeat Execution 
a ae ee ee See ee ee 
fee SP ate Ne ive pe tie: os [Eons 


The table shows the number of cycles required for a given TMS320C2x in- 
struction to execute in a given memory configuration when executed as a 
single instruction or in the repeat mode. The column headings in the tables 
indicate the program source location (PI, PE, or PR) and data destination 
or source (DI or DE), defined as follows: 


PI The instruction executes from internal program memory (RAM). 
PR __ The instruction executes from internal program memory (ROM). 
PE The instruction executes from external program memory. 

DI The instruction executes using internal data memory. 

DE The instruction executes using external data memory. 


The number of cycles required for each instruction is given in terms of the 
program/data memory and I/O access times as defined in the following 
listing: 


p Program memory wait states. Represents the number of clock cycles 
the device waits for external program memory to respond to an ac- 
cess. Tac is the access time, in nanoseconds, (maximum) required 
by the TMS320C2x for an external memory access to be made with 
no wait states. Tmem is the memory device access time, and T, is the 
clock period (4/crystal frequency). 


p=0; If Tmem S$ Tac 
p= 1; If Tac <Tmem S (Tp + Tac) 
p = 2; If (Tp + Tac)< T mem S$ (Tp x 2 + Tac) 
p=k; If [Tp x (k-1) + Tac]< T mem (Tp x kK + Tae) 
d Data memory wait states. Represents the number of cycles the de- 


vice must wait for external data memory to respond to an access. 
This number is calculated in the same way as the p number. 


i 1/O memory wait states. Represents the number of cycles the device 
must wait for external 1/O memory to respond to an access. This 
number is calculated in the same way as the p number. 


EXAMPLE Example Instruction EXAMPLE 


Example 
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Other abbreviations used in the tables and their meanings are as follows: 


br Branch from ... 

int Internal program memory. 

INT Interrupt. 

ext External program memory. 

n The number of times an instruction is executea when using the RPT 
or RPTK instruction. , 


Refer to Appendix D for further information on instruction cycle classifica- 
tions and timings 


ADD DAT1,3 (DP = 10) 
or 


ADD *,3 If current auxiliary register contains 1281. 
Before Instruction Afier Instruction 
Data Data 
1281 1281 


nce RS] ace We] 
C Cc 


The sample code presented in the above format shows the effect of the 
code on memory and/or registers. The use of the carry bit (C) provided on 
the TMS320C25 is shown in the small box. 


ABS 


Syntax 
Operands 


Execution 


Encoding 


Description 


Words 
Cycles 


Example 


‘20 
‘C25 


‘20 
‘C25 


Absolute Value of Accumulator ABS 


[<label>] ABS 


None 
(PC) +17 PC 
{(ACC)| ~ ACC 


Affects OV; affected by OVM. 
Affects C (TMS320C25). 
Not affected by SXM. 


15 14 13 12 1110 9 8 7 6 5 4 3 2 1 90 
1 1 0 0 1 1 1 0 0 0 0 1 1 0 1 1 


If the contents of the accumulator are greater than or equal to zero, the ac- 
cumulator is unchanged by the execution of ABS. If the contents of the 
accumulator are less than zero, the accumulator is replaced by its two’s- 
complement value. 


Note that >80000000 is a special case. When the overflow mode is not set, 
the ABS of >80000000 is >80000000. When in the overflow mode, the 
ABS of >80000000 is >7FFFFFFF. In either case, the OV status bit is set. 
The carry bit (C) on the TMS320C25 is always reset to zero by the exe- 
cution of this instruction. 


1 


Ne 2 2 ee ae 
Sane a Re ee ee a 
ee ey a ee Le ee eee 
ecco aoa ee eR SI Ws 


ABS 


Before Instruction After Instruction 


ace ace [[___>1738 | 
C C 

ace ace [| ] 
C C 
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ADD Add to Accumulator with Shift ADD 


Syntax 
Direct: [<label>] ADD <dma>[,<shift>] 
Indirect: [<label>] ADD {ind}[,<shift>[,<next ARP>]] 


Operands 0 < dma s 127 
O < next ARP < 7 
0 < shift < 15 (defaults to-0) 


Execution (PO) + 1 PC ‘ 
(ACC) + [(dma) x 2shift] + acc 


lf SXM = 1: 
Then (dma) is sign-extended. 
lf SXM = 0: 


Then (dma) is not sign-extended. 
Affects OV; affected by OVM and SXM. 
Affects C (TMS320C25). 


Encoding 15 14 13 12 1110 9 8 7 6 5 0 


4 3 2 1 
Direct: | 0 0 0 0 | Shift | Data Memory Address 
Indirect:||0 0 0 of shit | 1] See Section 4.1 


Description The contents of the addressed data memory location are left-shifted and 
added to the accumulator. During shifting, low-order bits are zero-filled. 
High-order bits are sign-extended if SXM = 1 and zero-filled if SXM = O. 
The result is stored in the accumulator. 


Words 1 


Cycles 
Cycle Timings for a Single Instruction 
PI/DI PI/DE PE/DI PE/DE PR/DI PR/DE — 
20,05 We rd ee Pet Se 
cap} tT td tte | tdtp [ted 
Cycle Timings for a Repeat Execution 
20, on | 2ntnd inte | 2ntndtp | - - 
‘c25 | in| ttntnd | ontp | ttnendtp| on | t+ntnd_| 
Example ADD ‘DAT1,3 (DP = 10) 
or 
ADD *,3 If current auxiliary register contains 1281. 
Before Instruction After Instruction 
Data Data 


1281 1281 


>8 
ace ace [a] 
| Cc Cc 


Add to Accumulator 
ADDC with Carry (TMS320C25 ADDC 


Syntax 
Direct: [<label>] ADDC <dma> 
Indirect: [<label>] ADDC {ind}[,<next ARP>] 


Operands O < dma <s 127 
O < next ARP < 7 


Execution (PC) +17 PC 
(ACC) + (dma) + (C) > ACC 


Affects OV and C; affected by OVM. 
Encoding 15 14 13 12 11 10 9 8 7 6 5 4 3 2 14 =O 


Direct:} O 1 0 0 0 0 1 1 re Data Memory Address 
6-4-0. "O OO 4) 4 th See Section 4.1. 


Description The contents of the addressed data memory location and the value of the 
carry bit are added to the accumulator. The carry bit is then affected in the 
normal manner. 


Indirect: 


The ADDC instruction can be used in performing multiple: precision arith- 


metic. 
Words 1 
Cycles 
Cycle Timings for a Single Instruction 
PI/DI! PI/DE. PE/DI PE/DE PR/DI PR/DE 
2+d 2+d 
Example 1 ADDC DATS5 (DP = 8) 
or 
ADDC * If current auxiliary register contains 1029. 
Before Instruction . After Instruction 
Data Data 
- 4029 10 


ace ff] 13 ] imme 
c . Cc 


Add to Accumulator 


ADDC with Carry (TMS320C25 ADDC 
Example 2 ADDC DATS (DP = 8) 
or 
ADDC * If current auxiliary register contains 1029. 
Before Instruction After Instruction 
Data Data 
1029 1029 


Cc C 
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ADDH Add to High Accumulator ADDH 


Syntax 


Direct: [<label>] ADDH <dma> 
Indirect: [<label>] ADDH {ind}[,<next ARP>] 


Operand 


Ss 


Execution 


Encoding 


Indirect: 


Descript 


Words 
Cycles 


Example 


Direct: 


ion 


‘20 
"C25 


‘20 
"C25 


O < dma < 127 
O < next ARP < 7 


(PC) +17 PC 


(ACC) + [(dma) x 216] > ACC 


Affects OV; affected by OVM. 
Affects C (TMS320C25). 
Low-order bits of the ACC not affected. 


15 14 13 12 11 10 9 8 7 6 5 4 3 2 141 #0 


oe ee ee ee ee Data Memory Address 


0 See Section 4.1 


The contents of the addressed data memory location are added to the upper 
half of the accumulator (bits 31 through 16). Low-order bits are unaffected 
by ADDH. The carry bit (C) on the TMS320C25 is set if the result of the 
addition generates a carry; otherwise, C is unaffected. The carry bit can 
only be set, not reset, by the ADDH instruction. 


The ADDH instruction may be used in performing 32-bit arithmetic. 
1 


[1 | 2a | t+p | 2a | - [| - 
A 
Tn | anend_[ np | 2nendep | - [- 
Pn tenend [np [tenendtp | nnd 


ADDH DATS (DP = 8) 


or 
ADDH * If current auxiliary register contains 1029. 
Before Instruction After Instruction 
Data Data 
1029 1029 


Cc C 
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Add to Accumulator 
ADDK Short Immediate (TMS320C25 ADDK 


Syntax [<label>] ADDK <constant> 
Operands 0 < constant < 255 


Execution (PC) +1 > PC 
(ACC) + 8-bit positive constant ~ ACC 


Affects OVM and C; affected by OVM. 
Not affected by SXM. 


Encoding 15 14 13 12 1110 9 8 7 6 5 4 3 2 1 =O 


Description The 8-bit immediate value is added, right-justified, to the accumulator with 
the result replacing the accumulator contents. The immediate value is 
treated as an 8-bit positive number, regardless of the value of SXM. 


Words 1 
Cycles 


Cycle Timings for a Single Instruction 
PI/DI Pi/DE PE/DI PE/DE PR/DI PR/DE 
a ay a a ee ee ee ee 


Cycle Timings for a Repeat Execution 


Example ADDK >5 


Before Instruction After Instruction 


ace fx][_=7ee2e1] ace [o][_+ 708268 | 
Cc . Cc 
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Add to Accumulator 


ADDS with Sign-Extension Suppressed ADDS 
Syntax 


Direct: [<label>] ADDS <dma> 
Indirect: [<label>] ADDS {ind}[,<next ARP>] 


Operand 


$s 


Execution 


Encoding 


Indirect: 


Descript 


Words 
Cycles 


Example 


Direct: 


ion 


‘20 
‘C25 


‘20 
"C25 


O < dma < 127 
O < next ARP < 7 


(PCy 2:1 PC 
(ACC) + (dma) ~ ACC 
(dma) is a 16-bit unsigned number. 


Affects OV; affected by OVM. 
Affects C (TMS320C25). 
Not affected by SXM. 


15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 =#0 


0 1 0 0) 1 (0) (0) 1 Data Memory Address 
Got 6.0. 4. O60 ta) See Section 4.1 


The contents of the specified data memory location are added with sign- 
extension suppressed. The data is treated as a 16-bit unsigned number, 
regardless of SXM. The accumulator behaves as a signed number. Note 
that ADDS produces the same results as an ADD instruction with SXM = 
0 and a shift count of 0. 


1 


PD 
[1 _| 2d | t+p | aeiep | - | - 
SR rc 

= 


Cycle Timings for a Repeat Execution 


2nd 


mare) pn tenon 


ADDS DATI1 (DP = 6) 
or 
ADDS * If current auxiliary register contains 779. 
Before Instruction After Instruction 
Data Data 
Memory >FO06 Memory >FO06 
779 779 


ace 3] ace (oa 
Cc C 
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Add to Accumulator 


ADDT with Shift Specified by T Register ADDT 
Syntax 


Direct: [<label>] ADDT <dma> 
Indirect: [<label>] ADDT {ind}[,<next ARP>] 


Operands 0 < dma < 127 
O < next ARP < 7 


Execution (PC)-+ } + PC ; 
(ACC) + [(dma) x oT register(3-0)) > (ACC) 


lf SXM = 1: 
Then (dma) is sign-extended. 
lf SXM = 0: 


Then (dma) is not sign-extended. 


Affects OV; affected by SXM and OVM. 
Affects C (TMS320C25). 


Encoding 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 90 


Direct: | 0 1 0 = #6«=O0 1 0 1 0 LQ. | Data Memory Address 
Indirect: 0 See Section 4.1 


Description The data memory value is left-shifted and added to the accumulator, with 
the result replacing the accumulator contents. The left-shift is defined by 
the four LSBs of the T register, resulting in shift options from O to 15 bits. 
Sign extension on the data memory value is controlled by SXM. 


Words 1 


Cycles 

a ees ee eee ee eee ee, 

Pt | 2d | tp | 2d | 1 | 24d | 
Cycle Timings for a Repeat Execution 

Pn] 2nend | ntp | anendtp | - |= id 

pn | tenend | tp [ttnendtp| | ttnend 


Example ADDT DAT127 (DP = 4) 


or 
ADDT * If current auxiliary register contains 639. 


‘20 
'C25 


‘20 
‘C25 
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Add to Accumulator 


ADDT with Shift Specified by T Register ADDT 


Before Instruction After Instruction 


Data Data 
Meson Haney 
639 63 
T >FFO4 T >FFO4 
ACC > FT15 acc [ol >F7A5 
Cc C 
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Add to Accumulator 


ADLK _ Long Immediate with Shift ADLK 
Syntax [<label>] ADLK <constant>[,<shift>] 
Operands 16-bit constant 


O < shift < 15 (defaults to 0) 


Execution (PC) +27 PC ; 
(ACC) + [constant x 2shift] + acc 


If SXM = 1: 
Then -32768 < constant < 32767. 
lf SXM = 0: 


Then O < constant < 65535. 


Affects OV; affected by OVM and SXM. 
Affects C (TMS320C25). 


Encoding 15 14 13 12 1110 9 8 7 6 5 4 3 2 1 #0 


110 i] shim [oo 0 0 0 0 10 
16-bit Constant 


Description The 16-bit immediate value, left-shifted as specified, is added to the accu- 
mulator. The result replaces the accumulator contents. SXM determines 
whether the constant is treated as a signed two’s-complement number or 
as an unsigned number. The shift count is optional and defaults to zero. 


Words 2 


Cycles 
ro{ 2 | 2 | 2+ | 2 [| - | - | 
"C25 
Cycle Timings for a Repeat Execution 
20{ notrepeatable | CCT 
C25 
Example ADLK 5,8 


Before Instruction - After Instruction 


nce BJL__=108F} ace [O[_>15F | 
Cc Cc 


Add to Auxiliary Register 


ADRK Short Immediate (TMS320C25 ADRK 
Syntax [<label>] ADRK <constant> 
Operands O < constant < 255 


Execution (PO) ei PC 
AR(ARP) + 8-bit positive constant ~ AR(ARP) 


Encoding 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 #0 


Description The 8-bit immediate value is added, right-justified, to the currently selected 
auxiliary register with the result replacing the auxiliary register contents. 
The addition takes place in the ARAU, with the immediate value treated as 
an 8-bit positive integer. 


Cycle Timings for a Single Instruction 
PI/DI PI/DE PE/DI PE/DE PR/D! PR/DE 


Cycle Timings for a Repeat Execution 


Example ADRK >80 (ARP = 5) 


Before Instruction After Instruction 


AR5 > 4321 AR5 >43A1 
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AND AND with Accumulator AND 


Syntax 
Direct: [<label>] AND <dma> 
Indirect: [<label>] AND {ind}[,<next ARP>] 


Operands 0 s dma s 127 
O < next ARP <7 


Execution (PC) +17 PC 
(ACC(15-0)).AND.(dma) ~ ACC(15-0) 
0 > ACC(31-16) 
Not affected by SXM. 


Encoding 15 14 13 12 1110 9 8 7 6 5 4 3 2 1 =O 
0 


Direct:| O 1 Oo oO 1 1 1 Data Memory Address 
Indirect} 0 1 0 0 1 1 +14 «Of1 | See Section 4.1 


Description The lower half of the accumulator is ANDed with the contents of the ad- 
dressed data memory location. The upper half of the accumulator is ANDed 
with all zeroes. Therefore, the upper half of the accumulator is always ze- 
roed by the AND instruction. 


Words 1 
Cycles 


Cycle Timings for a Single Instruction 
PI/DI PI/DE PE/DI PE/DE PR/DI PR/DE 


20[__1 | 2d | tp | 24s | - | - | 
cos[_1 | 2+d | tp | 2+drp | 1 [| 2rd | 
Cycle Timings for a Repeat Execution 


20[-_» | 2nend[_ntp | 2ntndtp |---| 
cas [__n | tentnd_| _otp | tentndtp| a | t+nend 
Example AND DAT16 (DP = 4) 
or 
AND bs If current auxiliary register contains 528. 
Before Instruction After Instruction 


Data - Data 

Memory >FF Memory [FF | 

528 528 

ACC >12345678 ACC >00000078 | 
Cc Cc 


ANDK 


Syntax 
Operands 


Execution 


Encoding 


Description 


Words 
Cycles 
‘20 
‘C25 
‘20 
"C25 


Example 


AND Immediate 
with Accumulator with Shift ANDK 


{<label>] ANDK <constant> [,<shift>] 


16-bit constant 
O < shift < 15 (defaults to 0) 


(PC) +2 > PC ; 
(ACC(30-0)).AND.[(constant x 2Shifty] + ACC(30-0) 
0 ~ ACC(31) and all other bit positions unoccupied by shifted constant. 


Not affected by SXM. 
15 14 13.12 1110 9 8 7 6 5 4 3 2 1 «0 


00 0 0 0 1 0 0 
16-bit Constant 


The 16-bit immediate constant is left-shifted as specified and ANDed with 
the accumulator. The result is left in the accumulator. Low-order bits below 
and high-order bits above the shifted value are treated as zeroes, clearing 
the corresponding bits in the accumulator. Note that the accumulator’s 
most-significant bit is always zeroed regardless of the shift-code value. 


No 


[2 [2 | 2a [ 2p | - [| - 
Eee ee | aap | eee te 
[___ notrepestabie dT SC Cd CC 


ANDK >FFFF,12 


Before Instruction After Instruction 


C Cc 
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APAC Add P Register to Accumulator APAC | 


Syntax [<label>] APAC 
Operands None 
Execution (PC) +17 PC 


(ACC) + (shifted P register) ~ ACC 


Affects OV; affected by PM and OVM. 
Affects C (TMS320C25). 
Not affected by SXM. 


Encoding 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 =O 
1 1 0 0 1 1 1 0 0 0 0 1 0 1 Oo #1 


Description The contents of the P register are shifted as defined by the PM status bits 
and added to the contents of the accumulator. The result is left in the ac- 
cumulator. APAC is not affected by the SXM bit of the status register; the 
P register is always sign-extended. 


The APAC instruction is a subset of the LTA, LTD, MAC, MACD, MPYA, 
and SORA instructions. 


Words 1 


Cycles 
2o[ + {| 1 | wp | tp | - | - 
cos{_ + | 1 | tp | te [| « | 7 | 
‘20 
'C25 

Example APAC (PM = 0) 


Before Instruction After Instruction 


nec 52] ace [80 


C C 
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B Branch Unconditionally_ 


Syntax 


Operands 


Execution 


Encoding 


ie] 


[<label>] B <pma>[,{ind}[,<next ARP>]] 


O < pma < 65535 
O < next ARP < 7 


pma ~ PC 
Modify AR(ARP) and ARP as specified. 


15 14 13 12 1110 9 8 7 6 5 4 3 2 1 «0 


1 1 1 1 1 1 1 1 1 See Section 4.1 
Program Memory Address 


Description The current auxiliary register and ARP are modified as specified, and control 


Words 
Cycles 


. Example 


passes to the designated program memory address (pma). Note that no 
AR or ARP modification occurs if nothing is specified in those fields. Pma 
can be either a symbolic or a numeric address. 


No 


eee ee ee 
2+p (ext-to-int) 2+2p (ext-to-ext) 


True Conditions: 
Destination on-chip RAM: 
2 2 


‘20 


‘C25 


2+2p 
Destination on-chip ROM: 
3 3 3+2p 
Destination external memory: 
3+p 3+p 3+3p 
False Condition: 


Destination anywhere: 
2 2 2+2p 2 2 


Cycle Timings for a Repeat Execution 


‘20 


"C25 not repeatable 
B PRG191 191 is loaded into the program counter, and 
the program continues running from that 
location. 
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BACC Branch to Address Specified by Accumulator BACC 


Syntax [<label>] BACC 
Operands None 


Execution (ACC(15-0)) > PC 


Encoding 15 14 13 12 1110 9 8 7 6 5 4 3 2 1 #0 
1 1 #0 0 141 4 4 O O O 14 0 0 4 OQ 1 


Description The branch uses the lower half of the accumulator (bits 15-0) for the 
branch address. 


Words 1 
Cycles 


Cycle Timings for a Single Instruction 


PI/DI PI/DE PE/DI PE/DE PR/DI PR/DE 
ee Se Se ee ee 


’C25 Destination on-chip RAM: 
2 2 2+p 2+p 2 2 
Destination on-chip ROM: 
3 3 3+p 3+p 3 3 
Destination external memory: 
3+p 3+p 3+2p 3+p 
Cycle Timings for a Repeat Execution 
‘20, Ss otrepeatable =| CT 
‘025 
Example BACC 
Before Instruction After Instruction 


PC >16E4 PC >9545 
ACC >F7FF9545 ACC >F7FF9545 


Cc C 
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BANZ Branch on Auxiliary Register Not Zero BANZ 


Syntax [<label>] BANZ <pma>[,{ind}[,<next ARP>]] 


Operands O < pma s 65535 
O < next ARP < 7 


Execution If AR(ARP) # 0: 
Then pma ~ PC; 
Else (PC) + 2 > PC. 
Modify AR(ARP) as specified. 


Encoding 15 14 13 12 1110 9 8 7 6 5 4 3 2 1 #90 


1 1 1 1 1 0 1 1 1 See Section 4.1 
Program Memory Address 


Description Control is passed to the designated program memory address (pma) if the 
current auxiliary register is not equal to zero. Otherwise, control passes to 
the next instruction. The current auxiliary register and ARP are also modi- 
fied as specified. 


The current auxiliary register is either incremented or decremented from zero 
when the branch is not taken. Note that the AR modification defaults to 
*- (decrement current AR by one) when nothing is specified, making it 
compatible with the TMS320C1x. Pma can be either a symbolic or a nu- 
meric address. : 


Words 
Cycles 


No 


Cycle Timings for a Single Instruction 


‘20 2 (br int-to-int) 2+p (int-to-ext) 
2+p (ext-to-int) 2+2p (ext-to-ext) 
‘C25 | True Conditions: 
Destination on-chip RAM: 
2 2 


2+2p 
Destination on-chip ROM: 
3 3 3+2p 
Destination external memory: 
3tp 3+p 3+3p 
False Condition: 


Destination anywhere: 
2+2p 2 2 


Cycle Timings for a Repeat Execution 
20]  inotrepeatable =| CCT 


'C25 | not repeatable 
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BANZ Branch on Auxiliary Register Not Zero BANZ 


Example 1 BANZ PRG35,*- 


w 
@ 
=< 
[o) 
ion 
oO 
> 
77) 
a+ 
c 
Vv 
={3. 
° 
5 
> 
+ 
oO 
ben J 
> 
77) 
~-s 
= 
5 
VPs 
ols 
3 


AR AR 
PC >46 PC >35 
or 
PC >46 PC >48 
Example 2 BANZ PRG64, *+ 
Before Instruction After Instruction 
PC Pe =64 
or 
AR AR 
PC PC 


Note: 


BANZ is designed for loop control using the auxiliary registers as loop 
counters. Using *0+ or *0- allows modification of the loop counter by 


a variable step size. Care must be exercised when doing this, however, 
because the auxiliary registers behave as modulo 65536 counters, and 
zero may be passed without being detected if ARO > 1. 


BBNZ Branch on Bit Not Equal to Zero BBNZ 


Syntax 
Operands 


Execution 


Encoding 


Description 


Words 
Cycles 
‘20 
"C25 
‘20 
"C25 
Example 


[<label>] BBNZ <pma>[,{ind}[,<next ARP>]] 


0 < pma <s 65535 
O < next ARP < 7 


If test/control (TC) status bit = 1: 
Then pma > PC; 
Else (PC) + 2 > PC. 
Modify AR (ARP) and ARP as specified. 


Affected by TC. 
15 14 13 12 1110 9 8 7 6 5 4 3 2 1 #90 


1 1 1 1 1 0 0 1 1 See Section 4.1 
Program Memory Address 


The current auxiliary register and ARP are modified as specified. Control 
then passes to the designated program memory address if TC = 1. Other- 
wise, control passes to the next instruction. Note that no AR or ARP 
modification occurs if nothing is specified in those fields. Pma can be either 
a symbolic or a numeric address. Note that the TC bit may be affected by 
the BIT, BITT, CMPR, LST1, NORM, RTC, and STC instructions. 


2 


2 (br int-to-int) 2+p (int-to-ext) oe te 
2+p (ext-to-int) 2+2p (ext-to-ext) 


True Conditions: 
Destination on-chip RAM: 
2 2 


2+2p 2+2p 2 2 
Destination on-chip ROM: 
3 3 3+2p 3+2p 3 3 
Destination external memory: 
3+p 3+p —3+3p 3+3p 3+p 3+p 


False Condition: 
Destination anywhere: 
2 2 2+2p 2 2 


Cycle Timings for a Repeat Execution 
not repeatable [- | - 
not repeatable 


BBNZ PRG650 © If TC = 1, 650 is loaded into the program 
counter; otherwise, the program counter 
is incremented by 2. 
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BBZ Branch on Bit Equal to Zero BBZ 
Syntax [<label>] BBZ <pma>[,{ind}[,<next ARP>]] 
Operands 0 < pma < 65535 
O < next ARP < 7 
Execution If test/control (TC) status bit = 0: 


Then pma > PC; 
Else (PC) + 2 > PC. 
Modify AR(ARP) and ARP as specified. 


Affected by TC bit. 
Encoding 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 #0 


Ca a oe ee ee ee See Section 4.1 
Program Memory Address 


Description The current auxiliary register and ARP are modified as specified. Control 
then passes to the designated program memory address if TC = 0. Other- 
wise, control passes to the next instruction. No AR or ARP modification 
occurs if nothing is specified in those fields. Pma can be either a symbolic 
or a numeric address. Note that the TC bit is affected by the BIT, BITT, 
CMPR, LST1, NORM, RTC, and STC instructions. 


Words | 2 
Cycles 


2+p (ext-to-int) 2+2p (ext-to-ext) 
’C25 | True Conditions: 
Destination one RAM: 


2+2p 
Destination on-chip ROM: 

3 3 3+2p 
Destination external memory: 
3+p 3+p 3+3p 

False Condition: 


Destination anywhere: 
2 2 2+2p 2 2 


Cycle Timings for a Repeat Execution 
20|  Cotrepeatable CT CCE 
"C25 


Example BBZ PRG325 If TC = 0, 325 is loaded into the program 
counter; otherwise, the program counter 
is incremented by 2. 
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BC Branch on Carry (TMS320C25 BC 


Syntax [<label>] BC <pma>[,{ind}[,<next ARP>]] 


Operands 0 < pma < 65535 
O < next ARP <7 


Execution If carry bit C = 1: 
Then pma > PC; 
Else (PC) + 2 > PC. 
Modify AR(ARP) and ARP as specified. 


Affected by C. 
Encoding 15 14 13 12 1110 9 8 7 6 5 4 3 2 1 =O 


0 1 0 1 1 1 1 0 1 See Section 4.1 
Program Memory Address 


Description The current auxiliary register and ARP are modified as specified. Control 
then passes to the designated program memory address if the carry bit C 
1. Otherwise, control passes to the next instruction. Note that no AR or 
ARP modification occurs if nothing is specified in those fields. Pma can 
be either a symbolic or a numeric address. 


Note that the carry bit C is affected by all add, subtract, and accumulate 
instructions as well as the ABS, LST1, NEG, RC, SC, rotate, and shift in- 
structions. The carry bit is not affected by execution of BC, BNC, or non- 
arithmetic instructions. 


Words 2 


‘Cycles 
Cycle Timings for a Single Instruction 


‘C25 | True Conditions: 
Destination on-chip RAM: 
2 2 2+2p 
Destination on-chip ROM: 
3 3 3+2p 


Destination external memory: 
3+p 3+p 3+3p 
False Condition: 
Destination anywhere: 
2 2 


2+2p 


Cycle Timings for a Repeat Execution 
‘C28 


Example BC PRG512 If the carry bit C = 1, 512 is loaded into 
the program counter; otherwise, the PC is 
incremented by 2. 
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Branch if Accumulator 


BGEZ Greater Than or Equal to Zero BGEZ 
Syntax [<label>] BGEZ <pma>[,{ind}[,<next ARP>]] 
Operands 0 < pma s 65535 
O < next ARP < 7 
Execution If (ACC) > 0: 


Then pma ~ PC; 
Else (PC) + 2 > PC. 
Modify AR (ARP) and ARP as specified. 


Encoding 15 14 13 12 1110 9 8 7 6 5 4 3 2 1 =O 


1 1 1 1 0 1 0 0 1 See Section 4.1 
Program Memory Address 


Description The current auxiliary register and ARP dre modified as specified. Control 
then passes to the designated program memory address (pma) if the con- 
tents of the accumulator are greater than or equal to zero. Otherwise, 
control passes to the next instruction. Note that no AR or ARP modification 
occurs if nothing is specified in those fields. Pma can be either a symbolic 
or a numeric address. 


Words 2 
Cycles 


2 (br int-to-int) 2+p (int-to-ext) ree 
2+p (ext-to-int) 2+2p (ext-to-ext) 


True Conditions: 
Destination on-chip RAM: 
2 2 


‘20 


‘C25 


2+2p 2+2p 2 2 
Destination on-chip ROM: 
3 3 3+2p 3+2p 3 3 
Destination external memory: 
3+p 3+p 3+3p 3+3p 3+p 3+p 


False Condition: 
Destination anywhere: 
2 2 2+2p 2 2 


Cycle Timings for a Repeat Execution 
not repeatable hoe aor 


‘20 


'C25 not repeatable 
Example BGEZ PRG217 217 is loaded into the program counter if 
the accumulator is greater than or equal 
to zero. 
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BGZ Branch if Accumulator Greater Than Zero BGZ 


Syntax 
Operands 


Execution 


Encoding 


Description 


Words 
Cycles 
‘20 
'C25 
‘20 
‘C25 
Example 


[<label>] BGZ <pma>[,{ind}[,<next ARP>]] 


0 < pma < 65535 
O < next ARP < 7 


lf (ACC) > 0: 
Then pma ~ PC; 
Else (PC) + 2 > PC. 
Modify AR(ARP) and ARP as specified. 


15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 #90 


1 1 1 1 0 0 0 1 1 an See Section 4.1 
Program Memory Address 


The current auxiliary register and ARP are modified as specified. Control 
then passes to the designated program memory address (pma) if the con- 
tents of the accumulator are greater than zero. Otherwise, control passes 
to the next instruction. Note that no AR or ARP modification occurs if 
nothing is specified in those fields. Pma can be either a symbolic or a nu- 
meric address. 


2 


2 (br int-to-int) 2+p (int-to-ext) ae ae 
2+p (ext-to-int) 2+2p (ext-to-ext) 


True Conditions: 
Destination on-chip RAM: 
2 2 


2+2p 2+2p 2 2 
Destination on-chip ROM: 
3 3 3+2p 3+2p 3 3 
Destination external memory: 
3+p 3+p 3+3p 3+3p 3+p 3+p 
False Condition: 
Destination anywhere: 
2+2p 


2 2 2 2 


Cycle Timings for a Repeat Execution 
not repeatable = . 
not repeatable 


BGZ PRG342 342 is loaded into the program counter if 
the accumulator is greater than zero. 
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BIOZ Branch on 1/O Status Equal to Zero BIOZ 


Syntax [<label>] BIOZ <pma>[,{ind}[,<next ARP>]] 


Operands 0 < pma < 65535 
O < next ARP <7 


Execution If BIO = 0: 
Then pma ~ PC; 
Else (PC) + 2 > PC. 
Modify AR(ARP) and ARP as specified. 


Encoding 15 14 13 12 1110 9 8 7 6 5 4 3 2 1 90 


1 1 1. 4 1 0 1 i¢) 1 See Section 4.1 
Program Memory Address 


Description The current auxiliary register and ARP are modified as specified. Control 
then passes to the designated program memory address (pma) if the BIO 
pin is low. Otherwise, contro! passes to the next instruction. Note that no 

_ AR or ARP modification occurs if nothing is specified in those fields. Pma 
can be either a symbolic or a numeric address. 


BIOZ in conjunction with the BIO pin can be used to test if a peripheral is 
ready to send or receive data. Polling the BIO pin using BIOZ may be pre- 
ferable to an interrupt when executing time-critical loops. 


Words 2 
Cycles 


PE/DI 
‘20 2 (br int-to-int) 2+p (int-to-ext) ia ae 
2+p (ext-to-int) 2+2p (ext-to-ext) 
‘C25 | True Conditions: 


Destination on-chip RAM: 
2 2 2+2p 


Destination on-chip ROM: 
3 


3+2p 
Destination external memory: 
3+p 3+p 3+3p 
False Condition: 


Destination anywhere: 
2 2 2+2p 2 2 


Cycle Timings for a Repeat Execution 
20{ notrepeatable | CO COT 
'C25 not repeatable 


Example BIOZ PRG64 If the BIO- pin is active (low), then 
a branch to location 64 occurs. 
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BIT Test Bit BIT 


Syntax 
_ Direct: [<label>] BIT <dma>,<bit code> 
Indirect: [<label>] BIT {ind},<bit code>[,<next ARP>] 


Operands O < dma s 127 
O < next ARP < 7 
O < bit code < 15 


Execution (PC): + 1. PC 
(dma bit at bit address (15-bit code)) > TC. 


Affects TC. 
Encoding 15 14 13 12 1110 9 8 7 6 5 4 3 2 1 =O 


Direct:} 1 0 0 1]  BitCode | Data Memory Address 
Indirect:| 1 0 0 1 { BitCode | 1) See Section 4.1 


Description The BIT instruction copies the specified bit of the data memory value to the 
TC bit of status register ST1. Note that the BITT, CMPR, LST1, and NORM 
instructions also affect the TC bit in status register ST1. A bit code value 
is specified that corresponds to a certain bit address in the instruction, as 
given by the following table: 


Bit Code 
Bit Address 1110 9 8 
(LSB) 0 yl fa a 
1 1110 
2 1101 
3 1100 
4 1.0. 7-4 
5 1010 
6 1001 
7 1000 
8 0111 
9 0110 
10 0101 
11 0100 
12 001 1 
13 0010 
14 0001 
(MSB) 15 0000 
Words 1 
Cycles 


Cycle Timings for a Single Instruction 


zof[_1 | 2a | ‘+p | 2+dep | - | - 1 

cas[_1 | 2a | 1+p | 2+dep | 1 | 2a | 
20 {a | 2n+nd_[_nép | 2ntndvp | - | 
c25 [__n | t+nend [| _nep | t+nendep| n _ | tenend_| 
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BIT 


Example BIT >0,>8 
or 
BIT *,8 
Data 
Memory 
>F400 
TC 
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Test Bit BIT 


(DP = 488) 


If current auxiliary register contains >F400. 


Before Instruction - After Instruction 
Data 
>7E98 Memory >7E98 
>F400 
TC | 


_— — 


BITT Test Bit Specified by T Register BITT 


Syntax 
Direct: [<label>] BITT <dma> 
Indirect: [<label>] BITT {ind}[,<next ARP>] 


Operands 0 < dma < 127 
QO < next ARP <7 


Execution (PC) +1 7PC 
(dma bit at bit address (15-T register(3-0))) + TC 
Affects TC. 


Encoding 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 90 


Direct:}0 1 0 1 0 1 14 +140] Data Memory Address 
4 Ot 4 0 eS See Section 4.1 


Description The BITT instruction copies the specified bit of the data memory value to 
the TC bit of status register ST1. Note that the BIT, CMPR, LST1, and 
NORM instructions also affect the TC bit in status register ST1. The bit 
address is specified by a bit code value contained in the LSBs of the T 
register, as given in the following table: 


Indirect: 


Bit Code 
Bit Address 32190 
(LSB) 0 1111 
1 1110 
2 y ie fas © a 
3 1100 
4 1011 
5 1010 
6 1001 
7 1000 
8 011 1 
9 0110 
10 0101 
11 0100 
12 001 1 
13 0010 
14 0001 
(MSB) 15 0000 
Words 1 
Cycles 


20{__1 | 2a [+p | zap | - | - | 
co _1 | 2a | 1+» | 2+dep | 1 | 2ea 
20{__» | 2nend | nep | anendvp | - | - | 
cas [mn | t+ntnd | nep | t+nend+p| n | tentnd | 
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Cite i a ees 


BITT Test Bit Specified by T Register BITT 


Example BITT >0 Value in T register points to bit 14 of 
data word (DP = 240). 
or 
BITT * If current auxiliary register contains >7800. 
Before Instruction After Instruction 
Data Data 
Memory >4DC8 Memory >4DC8 
>7800 >7800 
TR TR 
TC TC 
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Branch if Accumulator 


BLEZ Less Than or Equal to Zero BLEZ 
Syntax {<label>] BLEZ <pma> [,{ind}[, <next ARP>]] 
Operands 0 < pma < 65535 
O < next ARP <7 
Execution If (ACC) < 0: 


Then pma > PC; 
Else (PC) + 2 > PC. 
Modify AR(ARP) and ARP as specified. 


Encoding 15 14 13 12 1110 9 8 7 6 5 4 3 2 1 =O 


1 1 1 1 0 0 1 (0) 1 See Section 4.1 
Program Memory Address ; 


Description The current auxiliary register and ARP are modified as specified. Control 
then passes to the designated program memory address (pma) if the con- 
tents of the accumulator are Jess than or equal to zero. Otherwise, control 
passes to the next instruction. Note that no AR or ARP modification occurs 
if nothing is specified in those fields. Pma can be either a symbolic or a 
numeric address. 


Words 2 
Cycles 


2+p (ext-to-int) 2+2p (ext-to-ext) 


True Conditions: 
Destination on-chip RAM: 
2 2 2+2p 2+2p 2 2 


Destination on-chip ROM: 
3 3 


20 


"C25 


3+2p 3+2p 3 3 
Destination external memory: 
3+p 3t+p 3+3p 3+3p 3+p 3+p 


False Condition: 
Destination anywhere: 
2 2 2+2p 2 2 


Cycle Timings for a Repeat Execution 


‘20 not repeatable Sa ae 
1025 
Example BLEZ PRG63 63 is loaded into the program counter if 
the accumulator is less than or equal to 
zero. 
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Block Move 


BLKD from Data Memory to Data Memory BLKD 


Syntax 
Direct: 
Indirect: 


Operands 


Execution 


Encoding 
Direct: 


Indirect: 


Description 
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[<label>] BLKD <dmai>,<dma2> 
[<label>] BLKD <dma1>,{ind}[,<next ARP>] 


0 < dma < 65535 
O < dma2 < 127 
O < next ARP < 7 


TMS32020: 


(PC) +2-TOS 
dmai ~ PC 


lf (repeat counter) # 0: 

Then (dma1, addressed by PC) ~.dmaz2, 
Modify AR(ARP) and ARP as specified, 
(PC) +1 PC, 

(repeat counter) - 1 > repeat counter. 


Else (dma1, addressed by PC) ~ dma2 
Modify AR(ARP) and ARP as specified. 
(TOS) ~ PC 


TMS320C25: 


(PC) +2>PC 
(PFC) > MCS 
dmal > PFC 


If (repeat counter) # 0: 

Then (dmai1, addressed by PFC) > dma2, 
Modify AR(ARP) and ARP as specified, 
(PFC) + 1 > PFC, 

(repeat counter) - 1 > repeat counter. 


Else (dma1, addressed by PFC) ~ dma2 
Modify AR(ARP) and ARP as specified. 
(MCS) ~ PFC 


15 14 13 12 1110 9 8 7 6 5 4 3 2 1 «0 


1 1 1 1 1 1 0 1 Lo | Data Memory Address 2 


Data Memory Address 1 


ee a oe oe oe ee See Section 4.1 
Data Memory Address 1 


Consecutive memory words are moved from a source data memory block 
to a destination data memory block. The starting address (lowest) of the 
source block is defined by the second word of the instruction. The starting 
address of the destination block is defined by either the dma contained in 
the opcode (for direct addressing) or the current AR (for indirect address- 
ing). In the indirect addressing mode, both the current AR and ARP may 
be modified in the usual manner. In the direct addressing mode, dma2 is 
used as the destination address for the block move but is not modified upon 


Block Move 


BLKD from Data Memory to Data Memory BLKD 


Words 
Cycles 


‘20 


"C25 


20 


"C25 


repeated executions of the instruction. Thus, the contents of memory at the 
dmaz2 address will be the same as the contents of memory at the last dma1 
address in a repeat sequence. RPT or RPTK must be used with the BLKD 
instruction, in the indirect addressing mode, if more than one word is to be 
moved. The number of words to be moved is one greater than the number 
contained in the repeat counter RPTC at the beginning of the instruction. 
At the end of this instruction, the RPTC contains zero and, if using indirect 
addressing, AR(ARP) will be modified to contain the address after the end 
of the destination block. Note that the source and destination blocks do 
NOT have to be entirely on-chip or off-chip. However, BLKD cannot be 
used to transfer data from a memory-mapped register to any other location 
in data memory. 


The PC points to the instruction following BLKD after execution. Interrupts 
are inhibited during a BLKD operation used with RPT or RPTK. 


The BLKD instruction on the TMS32020 uses one level of stack. Therefore, 
the value on the bottom of the stack is lost since the stack is pushed and 
popped during the instruction operation. 


2 


Cycle Timings for a Single Instruction 
PI/DI PI/DE PE/DI PE/DE PR/DI PR/DE 


Data source internal:t 
3 3+d 3+2p 3+d+2p 

Data source external:t 
3+d 4+2d 3+d+2p 4+2d+2p 


Source data in on-chip RAM: 
3+d 3+2p 3+d+2p 


Source data in external memory: 
4+d 4+2d 4+d+2p 4+2d+2p 


Data source internal:t 
2+n 2+n+nd 2+n+2p 2+n+nd+2p 
Data source external:t 
2+n+nd 2+2n+2nd 2+nt+nd 2+2n+2nd 
+2p +2p 


Source data in on-chip RAM: 
2+n 2+n+nd 2+n+2p 2+nt+nd+2p 2+n 2+n+nd 
Source data in external memory: 
3+nt+nd 2+2n+2nd 3+n+nd 2+2n+2nd 3+n+nd 2+2n+2nd 
+2p +2p 


tColumn headings ‘DI/DE’ refer to data destination. 
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Block Move 


BLKD from Data Memory to Data Memory BLKD 
Example RPTK 2 
BLKD >F400,*+ If current auxiliary register contains 1030. 
dmat 
Before Instruction After Instruction 
Data Data 
Memory >7F98 Memory >7F98 
62464 62464 
Data Data 
. Memory >FFE6 Memory >FFE6 
62465 62465 
Data Data | 
Memory >9522 Memory >9522 
62466 62466 
dma2 
Before Instruction After Instruction 
Data Data 
Memory >8DEE Memory >7F98 
1030 
Data Data 
Memory >9315 Memory >FFE6 
1031 1031 
Data Data 
Memory >2531 Memory >9522 
1032 032 


Block Move 


BLKP from Program Memory to Data Memory BLKP 
Syntax 


Direct: [<label>] BLKP <pma>,<dma> 
Indirect: [<label>] BLKP <pma>,{ind}[,<next ARP>] 


Operands O < pma < 65535 
0 < dma <s 127 
O < next ARP <7 


Execution TMS32020: 


(PC) +27 TOS 
pma ~ PC 


If (repeat counter) # 0: 

Then (pma, addressed by PC) > dma, 
Modify AR(ARP) and ARP as specified, 
(PC) +1. > PC, 

(repeat counter) - 1 > repeat counter. 


Else (pma, addressed by PC) ~ dma 
Modify AR(ARP) and ARP as specified. 
(TOS) 7 PC 


TMS320C25: 


(PC) +2 PC 
(PFC) > MCS 
pma ~ PFC 


If (repeat counter) ¥ 0: 

Then (pma, addressed by PFC) ~ dma, 
Modify AR(ARP) and ARP as specified, 
(PFC) +1 > PFC, 

(repeat counter) - 1 ~ repeat counter. 


Else (pma, addressed by PFC) ~ dma 
Modify AR(ARP) and ARP as specified. 
(MCS) ~ PFC 


Encoding 15 14 13 12 1110 9 8 7 6 5 4 3 2 1 #0 
0 


Direct: | 1 1 1 1 1 1 0 Data Memory Address 
1 1 1 1 


Program Memory Address 


Indirect:| 11 OO See Section 4.1 
Program Memory Address 


Description Consecutive memory words are moved from a source program memory 
: block to a destination data memory block. The starting address (lowest) 

of the source block is defined by the second word of the instruction. The 

starting address of the destination block is defined by either the dma con- 

tained in the opcode (for direct addressing) or the current AR (for indirect 

addressing). In the indirect addressing mode, both the ARP and the current 

AR may be modified in the usual manner. in the direct addressing mode, 

dma is used as the destination address for the block move but is not modi- 
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Block Move 


BLKP from Program Memory to Data Memory BLKP 


Words 
Cycles 
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fied by repeated executions of the instruction. Thus, the contents of mem- 
ory at the dma address will be the same as the contents of memory at the 
last pma address in a repeat sequence. RPT or RPTK must be used with the 
BLKP instruction if more than one word is to be moved. The number of 
words to be moved is one greater than the number contained in the repeat 
counter RPTC at the beginning of the instruction. At the end of this in- 
struction, the RPTC contains zero and, if using indirect addressing, 
AR(ARP) will be modified to contain the address after the end of the des- 
tination block. Note that source and destination blocks do NOT have to 
be entirely on-chip or off-chip. 


The PC points to the instruction following BLKP after execution. Interrupts 
are inhibited during a BLKP operation. 


The BLKD instruction on the TMS32020 uses one level of stack. Therefore, 
the value on the bottom of the stack is lost since the stack is pushed and 
popped during the instruction operation. 


If the MP/MC pin on the TMS320C25 is low at the time of execution of this 
instruction and the program memory address used is less than 4096, an 
on-chip ROM location will be read. 


2 


Cycle Timings for a Single Instruction 
Pi/DI PI/DE PE/DI PE/DE PR/DI PR/DE 


Program source internal:t 
3 3+d_ 3+d+2p 
Program source external:t 
3+p 4+d+p 4+d+3p 
Table in on-chip RAM: 
3: 3+d 4+d+2p 
Table in on-chip ROM: 
4 4+d 4442p 
Table in external memory: 
4+p 4+d+p 4+d+3p 


Program source internal:t 
2+n 2+n+nd 2+n+2p 2+n+nd+2p 
Program source external:t 
2t+ntnp. 2+2nt+nd 2+nt+np 2+2n+nd+np 
+np +2p +2p 


Table in on-chip RAM: 
2+n 2+n+nd 3t+n+2p 34+n+nd+2p 3+n+nd 
Table in on-chip ROM: : 
3+n 3+n+nd 3tn+2p 3+n+nd+2p 3+n+nd 
Table in external memory: 
3+n+np 2+2n+nd 3+n+np 2+2n+nd+np 3+n+np 2+2n+nd 
+np +2p +2p +np 


tColumn headings ‘DI/DE’ refer to data destination. 


Block Move 


BLKP from Program Memory to Data Memory BLKP 
Example RPTK 2 


BLKP 65120,*+ If current auxiliary register contains 2048. 


pma 
Before Instruction After Instruction 

Program Program 

Memory _>A089 _ Memory >A089 
65120 65120 

Program Program 

Memory >2DCE Memory _ >2DCE 
65121 65121 

Program Program 

Memory >3A9F Memory >3A9F 

65122 65122 

dma 
Before Instruction After Instruction 

Data Data 

Memory >1234 Memory >AQ89 
2048 2048 
Data Data 

Memory >2005 Memory >2DCE 
2049 2049 
Data Data 

Memory >E98C Memory >S3A9F 
2050 2050 
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BLZ Branch if Accumulator Less Than Zero BLZ 


Syntax [<label>] BLZ <pma>[,{ind}[,<next ARP>]] 


Operands O < pma s 65535 
O < next ARP <7 


Execution lf (ACC) < 0: 
Then pma > PC; 
Else (PC) + 2 ~ PC. 
Modify AR(ARP) and ARP as specified. 


Encoding 15: 14 13:-72, 17.. 10. -9' 8&8 7..6. 58 4 3.2 #1,-0 


1 1 1 1 0 0 1 1 1 See Section 4.1 
Program Memory Address 


Description The current auxiliary register and ARP are modified as specified. Control 
then passes to the designated program memory address (pma) if the con- 
tents of the accumulator are less than zero. Otherwise, control passes to the 
next instruction. Note that no AR or ARP modification occurs when no- 
thing is specified in those fields. Pma can be either a symbolic or a numeric 


address. 
Words 2 
Cycles 
Cycle Timings for a Single Instruction 
PI/DI PI/DE PE/DI PE/DE PR/DI PR/DE 
‘20 2 (br int-to-int) 2+p (int-to-ext) 
2+p (ext-to-int) 2+2p (ext-to-ext) 
'C25 | True Conditions: 
Destination on-chip RAM: 
2 2+2p 2+2p 2 2 
Destination on-chip ROM: 
3 3 3+2p 3+2p 3 3 
Destination external memory: 
3+p 3+p 3+3p 3+3p 3+p 3+p 
False Condition: 
Destination anywhere: 
2 2 2+2p 2 2 
Cycle Timings for a Repeat Execution 
20[—notrepeatabie CdS SC CCC” 
‘C25 not repeatable ; 
Example BLZ PRG481 481 is loaded into the program counter if 


the accumulator is less than zero. 
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BNC Branch on No Carry (TMS320C25) BNC 


e 


Syntax [<label>] BNC <pma>[,{ind}[,<next ARP>]] 


Operands 0 < pma s 65535 
O < next ARP <7 


Execution If carry bit C = 0: 
Then pma > PC; 
Else (PC) + 2 ~ PC, 
Modify AR(ARP) and ARP as specified. 


Affected by C. 
Encoding 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 #90 


0 1 0 1 1 1 1 1 1 See Section 4.1 
Program Memory Address 


Description The current auxiliary register and ARP are modified as specified. Control 
then passes to the designated program memory address if the carry bit C = 
0. Otherwise, contro! passes to the next instruction. Note that no AR or 
ARP modification occurs when nothing is specified in those fields. Pma 
can be either a symbolic or a numeric address. 


Note that the carry bit C is affected by all add, subtract, and accumulate 
instructions as well as the ABS, LST1, NEG, RC, SC, rotate, and shift in- 
structions. The carry bit is not affected by execution of the BC, BNC, or 
nonarithmetic instructions. 


Words 2 
Cycles 


Cycle Timings for a Single Instruction 
PI/DI PI/DE PE/DI PE/DE PR/DI PR/DE 


‘C25 | True Conditions: 
Destination on-chip RAM: 
2 2 2+2p 


Destination on-chip ROM: 
3 3 


3+2p 
Destination external memory: 
3+p 3+p 3+3p 
False Condition: 


Destination anywhere: 
2 2 2+2p 


Cycle Timings for a Repeat Execution 
‘C25 


Example BNC PRG325 If the carry bit C = 0, 325 is loaded into 
the program counter. Otherwise, the PC is 
incremented by 2. 
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BNV 


Syntax 
Operands 
Execution 
Encoding 
Description 
Words 
Cycles 
20 
'C25 
‘20 
‘C25 
Example 
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Branch if No Overflow BNV 


[<label>] BNV <pma>[,{ind}[,<next ARP>]] 


0 < pma < 65535 
O < next ARP < 7 


If overflow OV status bit = 0: 

Then pma > PC; 

Else (PC) + 2 ~ PC and 0 > OV. 
Modify AR(ARP) and ARP as specified. 


Affects OV; affected by OV. 
15 14 13 12 1110 9 8 7 6 5 4 3 2 1 90 


1 1 1 1 0 1 1 1 1 See Section 4.1 
Program Memory Address 


The current auxiliary register and ARP are modified as specified. Control 
then passes to the designated program memory address (pma) if the OV 
(overflow fiag) is clear. Otherwise, the OV is cleared, and control passes 
to the next instruction. Note that no AR or ARP modification occurs if 
nothing is specified in those fields. Pma can be either a symbolic or a nu- 
meric address. 


2 


2+p (ext-to-int) 2+2p (ext-to-ext) 


True Conditions: 
Destination on-chip RAM: 
2 2 


2+2p 2+2p 2 2 

Destination on-chip ROM: 

3 3 3+2p 3+2p 3 3 
Destination external memory: 
3+p 3+p 3+3p 3+3p 3+p 3+p 

False Condition: 

Destination anywhere: 

2 2 2+2p 


2 2 
Cycle Timings for a Repeat Execution 


not repeatable a ee 
not repeatable 


BNV PRG315 315 is loaded into the program counter 
if the overflow flag is clear. OV is 
cleared. 


BNZ Branch if Accumulator Not Equal to Zero BNZ 


Syntax [<label>] BNZ <pma>[,{ind}[,<next ARP>]] 


Operands O < pma < 65535 
O < next ARP < 7 


Execution lf (ACC) # 0: 
Then pnia ~ PC; 
Else (PC) + 2 > PC. 
Modify AR(ARP) and ARP as specified. 


Encoding 15.14. 493 12.71. 10 9 8 7 6 5 4 3.2 1 0 


1 1 1 1 0) 1 0 1 1 See Section 4.1 
Program Memory Address 


Description The current auxiliary register and ARP are modified as specified. Control 
then passes to the designated program memory address (pma) if the con- 
tents of the accumulator are not equal to zero. Otherwise, control passes 
to the next instruction. Note that no AR or ARP modification occurs tf 
nothing is specified in those fields. Pma can be either a symbolic or a nu- 
meric address. 


Words 2 
Cycles 


oe ae ee en 
2+p (ext-to-int) 2+2p (ext-to-ext) 


True Conditions: 
Destination on-chip RAM: 
2 2 


‘20 
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2+2p 2+2p 2 2 
Destination on-chip ROM: 
3 3 3+2p — 3+2p 3 3 
Destination external memory: 
3+p 3t+p 3+3p 3+3p 3+p 3+p 


False Condition: 
Destination anywhere: 
2 


2+2p 2 2 
[______notrepeatable Sd dO 


Example — BNZ PRG320 320 is loaded into the program counter 
if the accumulator does not equal zero. 


‘20 
"C25 
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BV 


Syntax 
Operands 


Execution 


Encoding 


Description 


Words 
Cycles 


Example 
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Branch on Overflow BV 


[<label>] BV <pma>[,{ind}[,<next ARP>]] 


0 < pma s 65535 
O < next ARP < 7 


If overflow (OV) status bit = 1: 
Then pma ~ PC and O > OV; 
Else (PC) + 2 > PC. 
Modify AR(ARP) and ARP as specified. 


Affects OV; affected by OV. 
15 14 13 12 1110 9 8 7 6 5 4 3 2 1 =O 


1 1 1 1 0 0 (0) 0 1 See Section 4.1 
Program Memory Address 


The current auxiliary register and ARP are modified as specified, and the 
overflow flag is cleared. Control passes to the designated program memory 
address (pma) if the OV (overflow flag) is set. Otherwise, control passes 
to the next instruction. Note that no AR or ARP modification occurs if 
nothing is specified in those fields. Pma can be either a symbolic or a nu- 
meric address. 


2 


2 (br int-to-int) 2+p (int-to-ext) a 
2+p (ext-to-int) 2+2p (ext-to-ext) 


True Conditions: 
Destination on-chip RAM: 
2 


2+2p 2+2p — 2 2 
Destination on-chip ROM: 
3 3 3+2p 3+2p 3 3 
Destination external memory: 
3+p 3+p 3+3p 3+3p 3+p 3+p 


False Condition: 
Destination anywhere: 
2 2 2+2p 2 


2 
Cycle Timings for a Repeat Execution Z 
not repeatable oe ee 
not repeatable 


BV PRG610 If an overflow has occurred since the over- 
flow flag was last cleared, then 610 is 
loaded in the program counter. OV is 
cleared. 


BZ Branch if Accumulator Equals Zero B 


Syntax 
Operands 


Execution 


Encoding 


Description 


Words 
Cycles 
‘20 
'C25 
‘20 
‘C25 


Example 


N 


[<label>] BZ <pma>[,{ind}[,<next ARP>]] 


O < pma < 65535 
O < next ARP <7 


If (ACC) = 0: 

Then pma ~ PC; 

Else (PC) + 2 > PC. 
Modify AR(ARP) and ARP as specified. 


15 14 13 12 1110 9 8 7 6 5 4 3 2 1 #0 


1 1 1 1 0) 1 1 0 1 See Section 4.1 
Program Memory Address 


The current auxiliary register and ARP are modified as specified. Control 
then passes to the designated program memory address (pma) if the con- 
tents of the accumulator are equal to zero. Otherwise, control passes to the 
next instruction. Note that no AR or ARP modification occurs if nothing is 
specified in those fields. Pma can be either a symbolic or a numeric ad- 
dress. 


N 


2 (br int-to-int) 2+p (int-to-ext) eo e. 
2+p (ext-to-int) 2+2p (ext-to-ext) 


True Conditions: 
Destination on-chip RAM: 


2 2 2+2p 2+2p 2 2 
Destination on-chip ROM: 

3 3 3+2p 3+2p 3 3 
Destination external memory: 
3+p 3+p 3+3p 3+3p 3+p 3+p 


False Condition: 
Destination anywhere: 
2 2 2+2p 2+2p 2 2 


eo Cycle Timings for a Repeat Execution 
FSCSCnotrepeatable Td SCY 


not repeatable 


BZ PRG102 102 is loaded into the program counter if 
the accumulator is equal to zero. 
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CALA Call Subroutine Indirect CALA 


Syntax [<label>] CALA 

Operands None 

Execution (PC) +17 TOS 
(ACC(15-0)) > PC 


Encoding 15 14 13 12 1110 9 8 7 6 5 4 3 2 1 =O 
1 41 0 0O 4 4 4 0 0O 0 4 O O 4 O O 


Description The current program counter is incremented and pushed onto the top of the 
stack. Then, the contents of the lower half of the accumulator are loaded 
into the PC. The carry bit on the TMS320C25 is unaffected by this in- 
struction. 


The CALA instruction is used to perform computed subroutine calls. 
Words 1 . 


Cycles 
Pup 
1 [ea iy Eee ed RR, cc eee ee 
"C25 Destination on-chip RAM: 
2 2+p 2+p 2 2 
Destination on-chip ROM: 
3 3 3+p 3+p 3 3 
Destination external memory: 
3+p 3+p 3+2p 
20[7_notrepeatble SCY SCS 
‘C25 
Example CALA 
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CALA Call Subroutine Indirect CALA 


Before Instruction After Instruction 


ACC 


Stack 
(20) 


Stack 
(C25) 
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CALL 


Syntax 
Operands 


Execution 


Encoding 


Description 


Words 
Cycles 


Example 
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Call Subroutine CALL 


[<label>] CALL <pma>[,{ind}[,<next ARP>]] 


O < pma < 65535 
O < next ARP < 7 


(PC) -:2°> TOS 
pma ~ PC 


15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 =0 


1 1 1 1 1 1 1 0 1 See Section 4.1 
Program Memory Address 


The current auxiliary register and ARP are modified as specified, and the 
PC (program counter) is incremented by two and pushed onto the top of 
the stack. The specified program memory address (pma) is then loaded into 
the PC. Note that no AR or ARP modification occurs if nothing is specified 
in those fields. Pma can be either a symbolic or a numeric address. 


N 


eee Ee ee ee 
2+p (ext-to-int) 2+2p (ext-to-ext) 


True Conditions: 
Destination on-chip RAM: 
2 


2+2p 2+2p 2 2 
Destination on-chip ROM: 
3 3 3+2p 3+2p 3 3 
Destination external memory: 
3+p 3+p 3+3p 3+3p 3+p 3t+p 


False Condition: 
Destination anywhere: 
2 2 2+2p 2 2 


Cycle Timings for a Repeat Execution 
[~C~Cotrepeatabie ECCS 


CALL PRG109 


CALL 


PC 


Stack 
(20) 


Stack 
(C25) 


Call Subroutine 


Before Instruction 
>33 


>71 
>48 
>16 
>80 


PC 


Stack - 


(20) 


Stack 
(C25) 


CALL 


After Instruction 
>6D 


>35 
. >7I 
— >48 
>16 
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CMPL Complement Accumulator CMPL 


Syntax [<label>] CMPL 
Operands None 
Execution (PC) +17 PC 
(ACC) > ACC 
Encoding 15 14 13 12 11 10 9 8 .7 : 6 5 4 3 2 1 =O 


1 1 0 60600 1 1 1 0 0 0 1 0 0 1 1 1 


Description The contents of the accumulator are replaced with its logical inversion 
(one’s complement). 


Words 1 
Cycles 


ae el ETS Ee 
SA (eae a 
Example — CMPL 


Before Instruction After Instruction 


C Cc 
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Compare Auxiliary Register 


CMPR with Auxiliary Register ARO CMPR 
Syntax [<label>] CMPR <constant> 
Operands O<CM<s3 
Execution (PC) SPC 
Compare AR(ARP) to ARO, placing result in TC bit of status register ST1. 
Affects TC. 
Not affected by SXM; does not affect SXM. 
Encoding 15 14 13 12 1110 9 8 7 6 5 4 3 2 1 +90 


Description 


Words 
Cycles 


‘20 
‘C25 


'20 
"C25 


Example 


The CMPR instruction performs the following comparisons dependent on 
the value of CM: 


lf CM = 00, test if AR(ARP) = ARO 
lf CM = 01, test if AR(ARP) < ARO 
If CM = 10, test if AR(ARP) > ARO 
lf CM = 11, test if AR(ARP) # ARO 


If the result of a test is true, a one is loaded into the TC status bit. Other- 
wise, TC is loaded with a zero. The auxiliary registers are treated as un- 
signed integers in the comparison. , 


—_ 


Cycle Timings for a Single Instruction 
PI/DI PI/DE PE/DI PE/DE PR/DI PR/DE 


ee a Ei ee ee Se ee 
ee eS ee ee 
a ee eee ee ee ee 
oon en I. anes fe nb. oe 
CMPR 2 (ARP = 4) 
Before Instruction After Instruction 
TC Tc 
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CNFD Configure Block as Data Memory CNED 


Syntax 
Operands 


Execution 


Encoding 


Description 


Words 
Cycles 


‘20 
’C25 


‘C25 


Example 
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[<label>] CNFD 
None 


(PC) +41 PEC 
0 ~ RAM configuration control (CNF) status bit 


Affects CNF. 


15 14 13 12 1110 9 8 7 6 5 4 3 2 1 #90 
1 #14 0 0 4 4 tt 0 8 O08 0 0 0 4 O 0 


On-chip RAM block 0 is configured as data memory. The block is mapped 
to locations 512 through 767 in data memory. This instruction is the 
complement of the CNFP instruction and sets the CNF bit in status register 
ST1 to a zero. CNF is also loaded by the CNFP and LST1 instructions. 


On the TMS32020, the instruction fetch immediately following a CNFD or 
CNFP instruction uses the old CNF value. The second fetch uses the new 
CNF value, even if it is the fetch of the second word of a two-word in- 
struction. 


On the TMS320C25, the next two instruction fetches immediately follow- 
ing a CNFD or CNFP instruction use the old value of CNF. 


1 


[1 [1 i [me f - | - 
ot ee ee 
a OS 
ee ot ne 
CNFD A zero is loaded into the CNF status bit, thus 


configuring block BO as data memory (see 
memory maps in Section 3.4). 


CNFEP Configure Block as Program Memory CNFEP 


Syntax 
Operands 


Execution 


Encoding 


Description 


Words 
Cycles 
‘20 
'C25 
‘20 
‘C25 
Example 


{[<label>}] CNFP 
None 


(PC) +17 PC 
1 > RAM configuration control (CNF) status bit 


Affects CNF. 


15 14 13 12 1110 9 8 7 6 5 4 3 2 1 90 
1 41 0 0 14 4 4 0 0 0 0 080 OO 4 0 1 


On-chip RAM block 0 is configured as program memory. The block is 
mapped to locations 65280 through 65535 in program memory space. This 
instruction is the complement of the CNFD instruction and sets the CNF 
bit in status register ST1 to a one. CNF is also loaded by the CNFD and 
LST1 instruction. 


Configuring this block as program memory allows the use of the program 
counter as an address generator to access data from on-chip RAM. Used 
in conjunction with the repeat instructions, this allows two data memory 
locations to be addressed simultaneously, one from the auxiliary registers 
and one from the program counter. Instructions that take advantage of this 
feature are the MAC, MACD, BLKD, and BLKP instructions. 


On the TMS32020, the instruction fetch immediately following a CNFD or 
CNFP instruction uses the old CNF value. The second fetch uses the new 
CNF value, even if it is the fetch of the second word of a two-word in- 
struction. 


On the TMS320C25, the next two instruction fetches immediately follow- 
ing a CNFD or CNFP instruction use the old value of CNF. 


1 


Cycle Timings for a Single Instruction 
PI/DI PI/DE PE/DI PE/DE PR/DI PR/DE 


Cycle Timings for a Repeat Execution 


CNFP The CNF bit is set to a logic 1, thus config- 
uring block BO as program memory (see memory 
maps in Section 3.4) 
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DINT Disable Interrupt DINT 


Syntax 
Operands 


Execution 


Encoding 


Description 


Words 
Cycles 
‘20 
"C25 
‘20 
"C25 
Example 
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[<label>] DINT 
None 


(PC) + 1° = PC 
1 > interrupt mode (INTM) status bit 


Affects INTM. 


1S: 1413-592 41 10S: . BF. OB BS A BD TO 
1 1 0 0 1 1 1 0 0 0 0 0 0 0 Oo 1 


The interrupt mode (INTM) status bit is set to logic 1. Maskable interrupts 
are disabled immediately after the DINT instruction executes. Note that the 
LST instruction does not affect INTM. 


The unmaskable interrupt, RS, is not disabled by this instruction, and the 
interrupt mask register (IMR) is unaffected. Interrupts are also disabled by 
a reset. 


1 


Cycle Timings for a Single Instruction 


I a el 
Et el ee ee 
Cycle Timings for a Repeat Execution 
er ae ee ee ek ee 
a ae A a (ae a ee 


DINT Maskable interrupts are disabled, and INTM is 
set to one. . 


DMOV Data Move in Data Memory DMOV 


Syntax 
Direct: 
Indirect: 


Operands 


Execution 


Encoding 
Direct: 


Indirect: 


Description 


‘C25 


[<label>] DMOV <dma> 
[<label>] DMOV {ind}[,<next ARP>] 


QO < dma < 127 
O < next ARP < 7 


(PC) 7-1 C 
(dma) ~ dma + 1 


Affected by CNF. 
15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 #90 


0 1 0 1 0 1 1 0 Data Memory Address 
ae oe oe See Section 4.1 


The contents of the specified data memory address are copied into the 
contents of the next higher address. DMOV works only within the on-chip 
data RAM blocks BO, B1, and B2. It works within block BO if it is config- 
ured as data memory, and the data move function is continuous across the 
boundaries of blocks BO and B1; ie., it works for locations 512 to 1023. 
The data move function cannot be used on external data memory. If used 
on external data memory or memory-mapped registers, DMOV will read the 
specified memory location but will perform no other operations. 


When data is copied from the addressed location to the next, higher !o- 
cation, the contents of the addressed location remain unaltered. 


The data move function is useful in implementing the z”! delay encountered 
in digital signal processing. The DMOV function is included in the LTD and 
MACD instructions (see the LTD and MACD instructions for more infor- 
mation). 


Cycle Timings for a Single Instruction 
PI/DI PI/DE PE/DI PE/DE PR/DI PR/DE 


[on | tenend | np [tenendep | | tenend | 
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DMOV Data Move in Data Memory DMOV 


Example DMOV DATS (DP=4) 
or 


DMOV ~ If current auxiliary register contains 520. 


Before Instruction After Instruction 


Data Data 
Memory >43 Memory >43 
520 520 


Data 


Data 
Memory Memory >43 
521 521 
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EINT Enable Interrupt EINT 


Syntax [<label>] EINT 
Operands None 


Execution (PC) +1—74PC 
O > interrupt-mode (INTM) status bit 


Affects INTM. 


Encoding 15 14 13 12 1110 9 8 7 6 5 4 3 2 1 #0 
1 1 oO O 1 1 1 0 0 O08 0 0 0 0 0 0 


Description The interrupt-mode flag (INTM) in the status register is cleared to logic 0. 
Maskable interrupts are enabled after the instruction following EINT exe- 
cutes. This allows an interrupt service routine to re-enable interrupts and 
execute a RET instruction before any other pending interrupts are pro- 
cessed. Note that the LST instruction does not affect INTM. (See the DINT 
instruction for further information.) : 


Words 1 


PUDE 
1 [1 te Te TO 
Lt ot i ie 
7 [ome oe 
A 


Example EINT Unmasked interrupts are enabled, and INTM is 
set to zero. 


Cycles 


‘20 
‘C25 


‘20 
‘C25 
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FORT Format Serial Port Registers FORT 


Syntax [<label>] FORT <constant> 
Operands Constant = 0 or 1 


Execution (PO) ele PC 
Constant ~ format (FO) status bit 


Affects FO. 


Encoding 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 +90 
1 1 0 0 1 #1 4 #0 0 0 0 0 1 =1 = 1 [FO] 


Description The format (FO) status bit is loaded by the instruction with the LSB speci- 
fied in the instruction. The FO bit is used to control the formatting of the 
transmit and receive shift registers of the serial port. If FO = 0, the registers 
are configured to receive/transmit 16-bit words. If FO = 1, the registers are 
configured to receive/transmit 8-bit bytes. FO is set to zero on a reset. 


Words 1 


Cycle Timings for a Single Instruction 


Example FORT 1 The FO status bit is loaded with 1, making 
- the bit length of the serial port 8 bits. 
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IDLE Idle Until Interrupt IDLE 


Syntax [<label>] IDLE 

Operands None 

Execution TMS32020: 
(PC) +4-> PC 
TMS320C25: 
(PC) +17 PC 


O > interrupt mode (INTM) status bit 
Affects INTM. 


Encoding 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 =0 
1 #14 0 0 14 #4 #4 O O O O 14 #4 4 4 1 


Description The IDLE instruction forces the program being executed to wait until an 
interrupt or reset occurs. The PC is incremented only once, and the device 
remains in an idle state until interrupted. On the TMS32020, the INTM bit 
must be set to zero in order for the maskable interrupts to be recognized. 
On the TMS320C25, INTM is automatically set to zero. Execution of the 
IDLE instruction causes the TMS320C25 to enter the powerdown mode 
(see Section 3.6.7). 


Words 1 
Cycles 


Cycle Timings for a Single Instruction 


‘20 | 1 (min waits for INT) | 1+p(minwaitsforINT) {  - [| - | 
"C25 ({nterrupt) destination on-chip ROM: 


3 (min waits for INT) 


(interrupt) destination external memory: 
3+2p (min waits for INT) 


Cycle Timings for a Repeat Execution 
not repeatable Pots ead 
not repeatable 


Example IDLE The processor idles until a reset or unmasked 
interrupt occurs. 


‘20 
'C25 
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IN Input Data from Port IN 


Syntax 


Direct: [<label>] IN <dma>,<PA> 
Indirect: [<label>] IN {ind},<PA>[,<next ARP>] 


Operand 


Ss 


Execution 


Encoding 


Direct: 


0 < dma s 127 
O < next ARP < 7 
O < port address PA < 15 


(PC): 1 PC 

Port address ~ address bus A3-A0O 
O > address bus A15-A4 

Data bus D15-DO ~ dma 


15 14 13 12 11 10 9 


Port Address Data Memory Address 


Indirect: qs Os Oe 0. Port Address ea See Section 4.1 


Descript 


Words 
Cycles 


Example 
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ion 


‘20 
’C25 


‘20 
‘C25 


The IN instruction reads a 16-bit value from one of the external |/O ports 
into the specified data memory location. The IS line goes low to indicate 
an I/O access, and the STRB, R/W, and READY timings are the same as for 
an external data memory read. 


1 


[isi | _2+ani_| 2epri | tarps | - | - 
[neni [ansndeni [ 2nvpeni [Snengrpen] = [= | 


+ni 


IN STAT,PAS Read in word from peripheral on port 
address 5. Store in data memory 
location STAT. 


or 

LRLK ° 1,520 Load AR1 with decimal 520. 

LARP 1 Load ARP with decimal 520. 

IN *-,PA1,0 Read in word from peripheral on port 


address 1. Store in data memory 
location 520. Decrement AR1 to 519. 
Load the ARP with 0. 


LAC 


Syntax 


Direct: 
Indirect: 


Operands 


Execution 


Encoding 


Direct: 


Indirect: 


Description 


Words 
Cycles 


Example 


’20 


'C25 


‘20 
‘C25 


Load Accumulator with Shift LAC 


{[<label>] LAC <dma>[,<shift>] 
[<label>] LAC {ind}[,<shift>[,<next ARP>]] 


O < dma < 127 
O < next ARP <7 
0 < shift < 15 (defaults to 0) 


(PC) +17 PC 
(dma) x 2shift > acc 


If SXM = 1: 
Then (dma) is sign-extended. 
lf SXM = 0: 
Then (dma) is not sign-extended. 


Affected by SXM. 
15 14 13 12 1110 9 8 7 6 5 4 3 2 1 =0 


poo 4 of shit Data Memory Address 
aE a See Section 4.1 


The contents of the specified data memory address are left-shifted and 
loaded into the accumulator. During shifting, low-order bits are zero-filled. 
High-order bits are sign-extended if SXM = 1 and zeroed if SXM = 0. 


1 


[41 [aed [te ap | 
[7 arate | eae | | ee 


[anendep | - |__| 
[itnendtp | a toned 


LAC DAT6,4 (DP = 8) 


or 
LAC *,4 If current auxiliary register contains 1030. 
Before Instruction After Instruction 
Data Data 
Memory Memory 
1030 1030 


>1 
Cc Cc 


LACK 


Syntax 
Operands 


Execution 


Encoding 


Description 


Words 
Cycles 


Example 
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’20 
‘C25 


‘20 
'C25 


Load Accumulator Immediate Short LACK 


[<label>] LACK <constant> 
0 < constant < 255 

(PC) +17 PC 

8-bit positive constant ~ ACC 
Not affected by SXM. 


15 14 13 12 1110 9 8 7 6 5 4 3 2 1 +90 


The 8-bit constant is loaded into the accumulator right-justified. The upper 
24 bits of the accumulator are zeroed (i.e., sign extension is suppressed). 


1 


Ee le ae 
ee ee 
[rotrepeatable dT Cd 


LACK >15 


Before Instruction After Instruction 


C Cc 


Load Accumulator with 


LACT Shift Specified by T Register LACT 
Syntax 


Direct: [<label>] LACT <dma> 
Indirect: [<label>] LACT {ind}[,<next ARP> 


Operand 


Ss 


Execution 


Encoding 


Direct:| O 1 0 0 0 0 1 0 Data Memory Address 


Indirect: 


Descript 


Words 
Cycles 


Example 


ion 


‘20 
"C25 


20 
"C25 


0 < dma < 127 
O < next ARP <7 


(PC) 1. PC 
(dma) x 9T register(3-0) + ACC 


If SXM = 1: 
Then (dma) is sign-extended. 
If SXM = 0: 
Then (dma) is not sign-extended. 


Affected by SXM. 


15 14093 42-11. 10.9 8 7 6 6 A BS 2 910 


oto o o o 1 ols] seesectionat | 


The LACT instruction loads the accumulator with a data memory value that 
has been left-shifted. The left-shift is specified by the four LSBs of the T 
register, resulting is shift options from O to 15 bits. Using the T register’s 
contents as a shift code provides a variable shift mechanism. 


LACT may be used to denormalize a floating-point number if the actual 
exponent is placed in the four LSBs of the T register and the mantissa is 
referenced by the data memory address. Note that this method of denor- 
malization can only be used when the magnitude of the exponent is four 
bits or less. 


1 


(a Ge GE 
a RS 
[a [amend [ney | znendep | - | - 1 


Pn tenend | nap [tenendep | a tenend 
LACT DAT1 (DP = 6) 

or 

LACT * If current auxiliary register contains 769. 
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Load Accumulator with 


LACT Shift Specified by T Register LACT 
Before Instruction After Instruction 
Data Data 
Memory >1376 Memory >1376 
769 769 
ACC >98F7EC83 ACC >13760 
C C 
T >3014 T >3014 


LALK Load Accumulator Long Immediate with Shift _LALK 


Syntax 
Operands 


Execution 


Encoding 


[<label>] LALK <constant>[,<shift>] 


16-bit constant 
0 < shift < 15 (defaults to 0) 
(PC) +27 PC 
Constant x 28hift +> Acc 
If SXM = 1: 
Then -32768 < constant < 32767. 


if SXM = 0: 
Then 0 < constant < 65535. 


Affected by SXM. 
15 14 13 12 1110 9 8 7 6 5 4 3 2 1 «0 


16-Bit Constant 


Description 


Words 
Cycles 


‘20 
‘C25 


‘20 
‘C25 


Example 1 


Example 2 


The left-shifted 16-bit immediate value is loaded into the accumulator. The 
shifted 16-bit constant is sign-extended if SXM = 1; otherwise, the high- 
order bits of the accumulator (past the shift) are set to zero. Note that the 
MSB of the accumulator can only be set if SXM = 1 and a negative number 
is loaded. The shift count is optional and defaults to zero. 


Nh 


Cycle Timings for a Single Instruction 


P1/DI PI/DE PE/DI PE/DE PR/DI PR/DE 
2+2p | 242 | - _| 
2+2p 


Cycle Timings for a Repeat Execution 


not repeatable a ee 


not repeatable 


LALK >F794,8 (SXM=1) 
Before Instruction After Instruction 
ACC >12345678 ACC >FFF79400 
Cc Cc 
LALK >F794,8 (SXM=0) 


Before Instruction After Instruction 


Cc Cc 
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LAR Load Auxiliary Register LAR 
Syntax 
Direct: [<label>] LAR <AR>,<dma> 
Indirect: [<label>] LAR <AR>,{ind}[,<next ARP>] 

Operands 0 < dma s 127 

O < auxiliary register AR < 7 

O < next ARP < 7 
Execution (PG) +1 =PEC 

(dma) — auxiliary register AR 
Encoding 15 14 13 12 1110 9 8 7 6 5 4 3 2 1 9 


Indirect:| 0 0 1 1 of ar | 1| See Section 4.1 


Description 


Words 
Cycles 


Example 1 


4-82 


‘20 
‘C25 


‘20 
‘C25 


The contents of the specified data memory address are loaded into the 
designated auxiliary register (AR). 


The LAR and SAR (store auxiliary register) instructions can be used to load 
and store the auxiliary registers during subroutine calls and interrupts. If 
an auxiliary register is not being used for indirect addressing, LAR and SAR 
enable the register to be used as an additional storage register, especially 
for swapping values between data memory locations without affecting the 
contents of the accumulator. 


1 


PR/DI 

|_| _2+a | tp | 2edep | - 1 
[4 era | tp | aed | 1 «dC 
P ntp | 2nendep |__|] 
[np | anenarp |» | 


LAR. ARO,DAT10 (DP = 4) 
Before Instruction After Instruction 
Data Data 
522 522 
ARO ARO >18 


LAR Load Auxiliary Register LAR 


Example 2 LARP AR4 
LAR  AR4,*- 


Before Instruction 
Data 


Memory 
617 


>32 
AR4 


Note: 


After Instruction 


Data 

Memory >32 
617 
AR4 >32 


LAR, in the indirect addressing mode, ignores any AR modifications if 


the AR specified by the instruction is the same as that pointed to by the 
ARP. Therefore, in Example 2, AR4 is not decremented after the LAR 


instruction. 
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LARK Load Auxiliary Register Immediate Short LARK 


Syntax [<label>] LARK <AR>,<constant> 


Operands 0 < constant < 255 
O < auxiliary register AR < 7 


Execution § (PC) +17 PC 
8-bit constant ~ auxiliary register AR 


Encoding 15 14 13 12 1110 9 8 7 6 5 4 3 2 1 =O 


[7700 0] AR | —*BRConstome Cid 


Description The 8-bit positive constant is loaded into the designated auxiliary register 
(AR) right-justified and zero-filled (i.e., sign-extension suppressed). 


LARK is useful for loading an initial loop counter value into an auxiliary 
register for use with the BANZ instruction. 


= 
° 
al 
Q. 
a 
—_ 


Cycle Timings for a Single Instruction 
PI/DI P1/DE PE/DI PE/DE PR/DI PR/DE 


Cycle Timings for a Repeat Execution 
not repeatable ae ae 


‘C25 
Example LARK  ARO,>15 
Before Instruction After Instruction 


ARO ARO 
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LARP Load Auxiliary Register Pointer LARP 


Syntax 


Operands 


Execution 


Encoding 


Description 


Words 
Cycles 


‘20 
"C25 


‘20 
‘C25 


Example 


{<label>] LARP <constant> 
0 < constant < 7 


(PC) 1) = PC 
(ARP) ~ ARB 
Constant ~ ARP 


Affects ARP and ARB. 


15 14 13 12 1110 9 8 7 6 5 4 3 2 1 =«0 
oO 1 0 14 0 4 0 4 14 0 0 0 1 ARP 


The auxiliary register pointer is loaded with the contents of the three LSBs 
of the instruction (a 3-bit constant identifying the desired auxiliary register). 
The old ARP is copied to the ARB field of status register ST1. ARP can also 
be modified by the LST, LST1, and MAR instructions, as well as any in- 
struction that is used in the indirect addressing mode. 


The LARP instruction is a subset of MAR; i.e., the opcode is the same as 
MAR in the indirect addressing mode. The following instruction has the 
same effect as LARP: 


MAR *,<constant> 


Cycle Timings for a Single Instruction 
PE/DI . PE/DE {| PR/DI PR/DE.. 
ee ee ee 


a 

Cycle Timings for a Repeat Execution 
aE ey a ee Oe 
a ee ee ee ee ee 


LARP 1 Any succeeding instructions will use 
auxiliary register AR1 for indirect 
addressing. 
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LDP Load Data Memory Page Pointer LDP 
Syntax 


Direct: [<label>] LDP <dma> 
Indirect: [<label>] LDP {ind}[,<next ARP>] 


Operands 0 < dma < 127 
O < next ARP < 7 


Execution (PC) + 4->.PC 
Nine LSBs of (dma) > data page pointer register (DP) status bits 


Affects DP. 


Encoding 15 14 13 12 1110 9 8 7 6 5 4 3 2 1 9 


Direct:| O 1 Oo 1 0 oO 1 Data Memory Address 
Indirect} 0 1 0 1 +0 0 1 of1 | See Section 4.1 


oO 


Description The nine LSBs of the contents of the addressed data memory location are 
loaded into the DP (data memory page pointer) register. The DP and 7-bit 
data memory address are concatenated to form 16-bit data memory ad- 
dresses. The DP may also be loaded by the LST and LDPK instructions. 


Cycles 
Cycle Timings for a Single Instruction 
Pi/DI PI/DE PE/DI PE/DE PR/DI PR/DE | 
Example LDP DAT127 (DP = 511) 
or 
LDP * If current auxiliary register contains 65535. 
Before Instruction After Instruction 
Data Data 
Memory >FEDC Memory >FEDC 
65535 65535 
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LDPK Load Data Memory Page Pointer Immediate LDPK 


Syntax [<label>] LDPK <constant> 
Operands O < constant < 511 
Execution (PC) PPC 
Constant ~ data memory page pointer (DP) status bits 
Affects DP. 
Encoding 1S WAH 43) U2 4g AO: OB) CF - “Be Bs Se TD 


Description The DP (data memory page pointer) register is loaded with a 9-bit constant. 
The DP and 7-bit data memory address are concatenated to form 16-bit 
direct data memory addresses. DP > 8 specifies external data memory. 
DP = 4 through 7 specifies on-chip RAM blocks BO or B1. Block B2 is 
located in the upper 32 words of page 0. DP may also be loaded by the 
LST and LDP instructions. 


Words 1 
Cycles 


Tr + te |p | - | 
Sieh ee ee a 
[not repeatable 


Example LDPK 64 The data page pointer is set to 64. 


‘20 
"C25 


‘20 
'C25 
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LPH Load High P Register LP 


—< 


Syntax 
Direct: [<label>] LPH <dma> 
Indirect: [<label>] LPH {ind}[,<next ARP>] 


Operands 0 < dma < 127 
O < next ARP < 7 


Execution (PC) +17 PC 
(dma) ~ P register(31-16) 


Encoding 15 14 13 12 1110 9 8 7 6 5 4 3 2 1 90 


Direct: | 0 1 Oo 1 0 o 1 1 Data Memory Address 
Indirect:|}0 1 0 4 #0 0 1 141) See Section 4.1 


Description The P register high-order bits are loaded with the contents of data memory. 
The low-order P register bits are unaffected. 


The LPH instruction is particularly useful for restoring the high-order bits 
of the P register after subroutine calls or interrupts. 


Cycles 
Cycle Timings for a Single Instruction 
PI/DI PI/DE PE/DI PE/DE PR/DI PR/DE 
1 . 
Example LPH DATO (DP = 4) 
or 
LPH * If current auxiliary register contains 512. 
Before Instruction After Instruction 
Data Data 
Memory >F79C Memory >F79C 
512 512 


P >30079844 P >F79C9844 
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LRLK Load Auxiliary Register Long Immediate LRLK 


Syntax 
Operands 


Execution 


Encoding 


Description 


{[<label>] LRLK <AR>,<constant> 


O < auxiliary register < 7 
0 < constant < 65535 


(PC) +2 PC 
Constant ~ AR 
Not affected by SXM; does not affect SXM. 


15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 =O 


fi 0 OL AR. P00 0 0 oO O00 
16-Bit Constant 


The 16-bit immediate value is loaded into the auxiliary register specified by 
the AR field. The specified constant must be an unsigned integer, and its 
value is not affected by SXM. 


Cycle Timings for a Single Instruction 


PI/DI Pl/DE PE/DI PE/DE PR/DI PR/DE 
———————————————————— 
Z 


ee 


Cycle Timings for a Repeat Execution 


not repeatable i ee 


not repeatable 


Example 


LRLK AR3 ,>3080 


Before Instruction After Instruction 


AR3 >7F80 AR3 >3080 
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LST Load Status Register STO LST 


Syntax 


Direct: 
Indirect: 


Operands 


Execution 


Encoding 


[<label>] LST <dma> 
[<label>] LST {ind}[,<next ARP>] 


O < dma < 127 
O < next ARP < 7 


(PC) +17 PC 
(dma) ~ status register STO 


Affects ARP, OV, OVM, and DP. 
Does not affect INTM or ARB. 


15 14 13 12 1110 9 8 7 6 5 4 3 2 1 =0 
0 


Direct:| 0 1 Oo 1 0 0 0 Data Memory Address 
1 0 1 0 0 0 


Indirect:| 0 See Section 4.1 


Description 


Example 1 


4-90 


‘20 
‘C25 


‘20 
‘C25 


| 


Status register STO is loaded with the addressed data memory value. Note 
that the INTM (interrupt mode) bit is unaffected by LST. ARB is also un- 
affected even though a new ARP is loaded. If a next ARP value is specified 
via the indirect addressing mode, the specified value is ignored. Instead, 
ARP is loaded with the value contained within the addressed data memory 
word. 


The LST instruction is used to load status register STO after interrupts and 
subroutine calls. The STO contains the status bits: OV (overflow flag) bit, 
OVM (overflow mode) bit, INTM (interrupt mode) bit, ARP (auxiliary reg- 
ister pointer), and DP (data memory page pointer). These bits were stored 
(by the SST instruction) in the data memory word as follows: 


16 14 13 12 8 7 6 5 4 3 2 1 0 


11 10 9 
om + [amy SP SCSC*d 


1 


ee ee 
ee 


ET ee ee ee 
Cycle Timings for a Repeat Execution 


LST cee i The data memory word addressed by the 
contents of auxiliary register ARO is 
loaded into status register STO, except 
for the INTM bit. Note that even though 
a next ARP value is specified, that value 
is ignored, and even though a new ARP is 
loaded, the old ARP is not loaded into ARB. 


LST Load Status Register STO LST 


Example 2 LST >60 


Data 
Memory 
96 
STO 
ST1 


LARP AR4 
LST t= 


Example 3 


Example 4 LARP 


LST 


(DP = 0) 


Before Instruction 


>2404 


>6E00 
>0580 


(AR4 = >3FF) 


>3FF 


>CE06 


>FCO4 
>E780 


ive] 
© 
= 
io) 
= 
© 
> 
: nN 
-- 
x 
Cc 
QO 
= 
ie) 
a 


(AR4 = >3FF) 


>3FF 


>EEO4 


>EE00O 
>F780 


ies] 
@o 
paral 
je) 
= 
© 
=} 
n 
+ 
on 
Cc 
oO 
=a 
fe) 
3 


Data 
Memory 
96 


STO 
ST1 


AR4 
Data 
Memory 
1023 
STO 

ST1 


AR4 

Data 
Memory 

1023 


STO 
ST1 


> 
4 
ma 
9°) 
tnd 
— 
n 
+ 
Gal 
Cc 
iz) 
S. 
io) 
p> 


>2404 


>2604 
>0580 


>3FE 
>CE06 


>CCO06 
>E780 


>3FE 
> EEO4 


>EEO4 
>F780 


> > 
a ee 
[nal Comal 
@ Oo 
= = 
= a 
n n 
+ > 
= ond 
Cc Cc 
Q Q 
=x S. 
° 1°) 
3 =] 


LST1 Load Status Register ST1 LST1 


Syntax 
Direct: [<label>] LST1 <dma> 
Indirect: [<label>] LST1 {ind}[,<next ARP>] 


Operands 0 < dma < 127 
O < next ARP < 7 


Execution (PC)-+-} =: PE 
(dma) ~ status register ST1 
(ARB) ~ ARP 


Affects ARP, ARB, CNF, TC, SXM, XF, FO, TXM, and PM. 
Affects C, HM, and FSM (TMS320C25) 


Encoding 15 14 13 12 1110 9 8 7 6 5 4 3 2 1 90 


Direct:} 0 1 Oo 1 0 0 oO 1 Data Memory Address 
Indirect:|0 1 0 1 0 0 0 141] See Section 4.1 


Description Status register ST1 is loaded with the data memory value. The bits of the 
data memory value, which are loaded into ARB, are also loaded into ARP 
to facilitate context switching. Note that if a next ARP value is specified 
via the indirect addressing mode, the specified value is ignored. 


LST1 is used to load status bits after interrupts and subroutine calls. ST1 
contains the status bits: ARB (auxiliary register pointer buffer), CNF (RAM 
configuration control) bit, TC (test/control) bit, SXM (sign-extension 
mode) bit, XF (external flag) bit, FO (serial port format) bit, TXM (transmit 
mode) bit, and the PM (product register shift mode) bit. ST1 on the 
TMS320C25 also contains the status bits: C (carry) bit, HM (hold mode) 
bit, and FSM (frame synchronization mode) bit. On the TMS32020, bits 
5,6, and 9 are one’s. The bits loaded into status register ST1 from the data 
memory word are as follows: 


15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 


| ARB | cNF] TC [sxm] ct [11 |HMf | esmt | xr] Fo] TXM] PM _ 


TOn the TMS32020, bits 5, 6, and 9 are one’s. 


Cycle Timings for a Single Instruction 
PI/DI PI/DE PE/DI PE/DE PR/DI PR/DE 


cos[ 1 | 24 | t+» | 2+dep | 1 | 2d 
Cycle Timings for a Repeat Execution 


20[_p } 2ntnd_T_nep | anendep |---| 
cas [on nep | 2néndep [a | antnd 
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LST1 Load Status Register ST1 LST1 


LARP 3 
LST1 alia 


Example 1 


Example 2 LSTl >61 


Data 
Memory 
97 
STO 
ST1 


LARP AR4 
LST1 *- 


Example 3 


AR4 


oe 
emo 
1 022" 


STO 
ST1 


LARP AR4 
LST1 *-,1 


Example 4 


AR4 
Data 
Memory 
1022 


STO 
ST1 


The data memory word addressed by the 


contents 
replaces 
register 


(DP = 0) 


Before Instruction 


Data 
>0580 Memory 
97 
>ACO00 STO 
>0581 ST1 
(AR4 = >3FE) 
Before Instruction 
ARA 
Data 
>4F90 Memory 
1022 
>FCO4 STO 
>E780 ST1 
(AR4 = >3FE) | 
Before Instruction 
ARA 
Data 
Memory 
1022 
>FEO4 STO >7E04 
>0593 ST1 


of auxiliary register AR3 
the status bits of status 


ST1, and AR3 is decremented. 


>0580 


>0C00 
>0580 


>3FD 


>4F90 


>5C04 
>4F90 


>3FD 


>6190 


> > > 
> pes => 
@O $4) oO 
= = = 
= = 3 
n n Ww 
-- -- - 
=x “ = 
Cc Cc Cc 
i?) Q i?) 
° ° ° 
s 3 =] 


>6190 
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LT Load T Register LT 


Syntax 
_ Direct: [<label>] LT <dma> 
Indirect: [<label>] LT {ind}[,<next ARP>] 


Operands 0 < dma < 127 
O < next ARP <7 


Execution (PG). + 1.7 °PC 
(dma) ~ T register 


Encoding 15 14 13 12 1110 9 8 7 6 5 4 3 -2 1 =O 


Direct:} 0 O 1 1 1 1 0 O Data Memory Address 
Indirect:|}0 0 1 #1 #1 +1 +~0 of1 | See Section 4.1 


Description The T register is loaded with the contents of the specified data memory 
address (dma). The LT instruction may be used to load the T register in 
preparation for multiplication. See the LTA, LTD, LTP, LTS, MPY, MPYK, 
MPYA, MPYS, and MPYU instructions. 


Cycles 
Cycle Timings for a Single Instruction 
PI/DI PI/DE PE/DI PE/DE PR/DI PR/DE 
Example LT DAT24 (DP = 8) 
or 
LT * If current auxiliary register contains 16048. 
Before Instruction After Instruction 
Data Data 
1048 1048 
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LTA Load T Register and Accumulate Previous Product LTA 


Syntax 


Direct: 
Indirect: 


Operands 


Execution 


Encoding 


Direct: 


Indirect: 


Descriptior 


Words 
Cycles 


Example 


‘20 
‘C25 


‘20 
"C25 


[<label>] LTA <dma> 
{<label>] LTA {ind}[,<next ARP>] 


O < dma < 127 
OQ < next ARP < 7 


(PC) +17 PC 
(dma) ~ T register 
(ACC) + (shifted P register) ~ ACC 


Affects OV; affected by OVM and PM. 
Affects C (TMS320C25). 


15 14 13 12 1110 9 8 7 6 5 4 3 2 1 #0 


0 0 1 1 1 1 0 1 Data Memory Address 
hl ee ee ee ee ee ee See Section 4.1 


The T register is loaded with the contents of the specified data memory . 
address (dma). The contents of the product register, shifted as defined by 
the PM status bits, are added to the accumulator, with the result left in the 
accumulator. 


The function of the LTA instruction is included in the LTD instruction. 


1 a 


Pt | ad | ip | tees | = 
[42d | top | 2vaep [dC 
[a | 2nend | ntp | anendep | - [= +d 
Pn tentnd [np [tenenatp | on | tenn 


LTA DAT36 (DP = 6, PM = 0) 
or 
LTA * If current auxiliary register contains 804. 
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LTA Load T Register and Accumulate Previous Product LTA 


Before Instruction After Instruction 
Data Data 
Memory >62 Memory 
804 804 
T T >62 
Acc ace [9 
Cc Cc 


Load T Register, Accumulate 


LTD Previous Product, and Move Data LTD 
Syntax 
Direct: [<label>] LTD <dma> 
Indirect: [<label>] LTD {ind}[,<next ARP>] 

Operands 0 < dma <s 127 
0 < next ARP <7 

Execution (PC) +17 PC 
(dma) ~ T register 
(dma) ~ dma + 1 
(ACC) + (shifted P register) ~ ACC 
Affects OV; affected by OVM and PM. 
Affects C (TMS320C25). 

encosing:: 15 14 13 12 11 10 9 

Description The T register is joaded with the contents of the specified data memory 
address (dma). The contents of the P register, shifted as defined by the 
PM status bits, are added to the accumulator, and the result is placed in the 
accumulator. The contents of the specified data memory address are also 
copied to the next higher data memory address. 
This instruction is valid for blocks B1 and B2, and is also valid for block 
BO if block BO is configured as data memory. The data move function is 
continuous across the boundary of blocks BO and B1, but cannot be used 
with external data memory or memory-mapped registers. This function is 
described under the instruction DMOV. Note that if used with external data 
memory, the function of LTD is identical to that of LTA. 

Words 1 

Cycles 


zof_1 | 2a | +p | 2aep | - | - | 
cos { 1 | aed | t+p_| 2edep | 1 | 20a 


20[__n | 2nend | nép | anendvp | - [> 
25 [-n | t+nend | nep[t+nena+p] n _| tenend | 


Cycle Timings for a Single Instruction 
PI/DI PI/DE PE/DI PE/DE PR/DI PR/DE 


Cycle Timings for a Repeat Execution 
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Load T Register, Accumulate 


LTD Previous Product, and Move Data LTD 
Example LTD DAT126 (DP = 7, PM = 0) 
or 
LTD * If current auxiliary register contains 1022. 
Before Instruction After Instruction 
Data Data 
Memory 62 Memory 62 
1022 1022 


Data 


yey vey >62 
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Load T Register and Store 


LTP P Register in Accumulator LTP 
Syntax 


Direct: [<label>] LTP <dma> 
Indirect: [<label>] LTP {ind}[,<next ARP>] 


Operands O < dma < 127 
O < next ARP < 7 


Execution (PC) sei se PC 


(dma) ~ T register 
(shifted P register) ~ ACC 


Affected by PM. 
Encoding 15 14 13 12 1110 9 8 7 6 5 4 3 2 141 =O 
0 


Direct:} 0 O 1 1 1 1 1 Data Memory Address 
Indirect} 0 0 1 #1 #14 +14 =~«1 ~«~0[1 | See Section 4.1 


Description The T register is loaded with the contents of the addressed data memory 
location, and the product register is stored in the accumulator. The shift at 
the output of the product register is controlled by the PM status bits. 


Words j 


Cycles 
Cycle Timings for a Single Instruction 
PI/DI PI/DE PE/DI PE/DE PR/DI PR/DE 
2o[- 1 | 24d | 1 | 2+ | - [| - 
cos [1 | 24d | 1+p | 2edtp | 1 | 24d | 
Cycle Timings for a Repeat Execution 
20[n | 2ntnd | ntp | 2ntndep | - i - 
cos [mn | tentnd | n+p [t+ntndtp| mn | ttarnd_| 
Example LTP DAT36 (DP = 6, PM = O) 
or 
LTP * If current auxiliary register contains 804. 
Before Instruction After Instruction 
Data Data 
Memory Memory >62 
804 804 


4-99 


LTS Load T Register, Subtract Previous Product LTS 


Syntax 


Direct: [<label>] LTS <dma> 
Indirect: [<label>] LTS {ind}[,<next ARP>] 


Operand 


Ss 


Execution 


Encoding 


Indirect: 


Direct: 


Description 


Words 
Cycles 


Example 
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0 < dma < 127 

0 < next ARP < 7 

(PC). + > PC 

(dma) ~ T register 

(ACC) - (shifted P register) ~ ACC 


Affects OV; affected by PM and OVM. 
Affects C (TMS320C25). 


15 14 13 12 1110 9 8 7 6 5 4 3 2 1 +40 
1 


0.4 64 1 OO 1 Data Memory Address _ 


ee ee ee ee See Section 4.1 


The T register is loaded with the contents of the addressed data memory 
location. The contents of the product register, shifted as defined by the 
contents of the PM status bits, are subtracted from the accumulator. The 
result is left in the accumulator. . 


1 


Cycle Timings for a Single Instruction 
PI/DI PI/DE PE/DI PE/DE PR/DI PR/DE 
Se ae a ee ee ek ee 


2d ee eae 
a 
[nf aan tp | anendip | - | 
[pn tenend [np tenendep | | 


LTS DAT36 (DP = 6, PM = 0) 
or 
LTS * If current auxiliary register contains 804. 
Before Instruction After Instruction 
Data Data 
Memory >62 Memory >62 
804 804 


MAC Multiply and Accumulate MAC 


Syntax 
Direct: [<label>] MAC <dma>,<pma> 
Indirect: [<label>] MAC <pma>,{ind}[,<next ARP>] 


Operands O < pma < 65535 
O < dma < 127 
O < next ARP < 7 


Execution TMS32020: 


(PC) +27 TOS 
(pma) ~ PC 


If (repeat counter) # 0: 
Then (ACC) + (shifted P register) ~ ACC, 
(dma) ~ T register, 
(dma) x (pma, addressed by PC) ~ P register, 
Modify AR(ARP) and ARP as specified, 
(PC) +17 >: PC, 
(repeat counter) - 1 ~ repeat counter. 
Else (ACC) + (shifted P register) ~ ACC 
(dma) ~ T register 
(dma) x (pma, addressed by PC) ~ P register 
Modify AR(ARP) and ARP as specified. 
(TOS) > PC 


Affects OV; affected by OVM and PM. 


TMS320C25: 
(PC) + 2-PC 
(PFC) > MCS 
(pma) ~ PFC 
If (repeat counter) # 0: 
Then (ACC) + (shifted P register) ~ ACC, 
(dma) ~ T register, 
(dma) x (pma, addressed by PFC) ~ P register, 
Modify AR(ARP) and ARP as specified, 
(PFC) +1 > PFC, 
(repeat counter) - 1 ~ repeat counter. 
Else (ACC) + (shifted P register) ~ ACC 
(dma) ~ T register 
(dma) x (pma, addressed by PFC) ~ P register 
Modify AR(ARP) and ARP as specified. 
(MCS) > PFC 


Affects C and OV; affected by OVM and PM. 


Encoding 156 14 13 12 11 10 9 8 7 6 5 4 3 2 1 #90 


Direct} 0 1 0 1 1 1 0 41/0] Data Memory Address _ 
Program Memory Address 


Indirect} 0 4 0 1 1 4 0 141 | See Section 4.1 
Program Memory Address 
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MAC Multiply and Accumulate MAC 


Description 


Words 
Cycles 


Example 


4-102 
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The MAC instruction multiplies a data memory value (specified by dma) 
by a program memory value (specified by pma). It also adds the previous 
product, shifted as defined by the PM status bits, to the accumulator. The 
data and program memory locations on the TMS320C25 may be any non- 
reserved, on-chip or off-chip memory locations. If the program memory is 
block BO of on-chip RAM, then the CNF bit must be set to one. On the 
TMS32020, data and program memory locations must reside on-chip. Note 
that on both devices, the upper eight bits of the program memory address 
should be set to >FF in order to address BO program RAM, and the upper 
six bits of dma should be set to 0 to address a location below 1024. When 
used in the direct addressing mode, the dma cannot be modified during 
repetition of the instruction. 


When the MAC instruction is repeated, the program memory address con- 
tained in the PC/PFC is incremented by one during its operation. This en- 
ables accessing a series of operands in memory. MAC is useful for long 
sum-of-products operations, since MAC becomes a single-cycle instruction 
once the RPT pipeline is started. 


2 


Cycle Timings for a Single Instruction 
PI/DI PI/DE PE/DI PE/DE PR/DI PR/DE 
3 _|_WA | 3+ | WA | - | - 


Table in on-chip RAM: 
3 4+d 4+2p 5+d+2p 4 
Table in on-chip ROM: 
4 5+d 4+2p 5+d+2p 4 
Table in external memory: 
4+p 5+d+p 4+3p 5+d+3p 4+p 


Cycle Timings for a Repeat Execution 
Pate] N/A | 2én+2p | N/A | - | -__ 


Table in on-chip RAM: 


2+n 2+2n+nd 3+n+2p 34+2n+nd+2p 3+n 3+2n+nd 
Table in on-chip ROM: 
3+n 3+2n+nd 3tn+2p 3+2n+nd+2p 3+n 3+2n+nd 
Table in external memory: 
3+n+np 3+2n+nd 3+nt+np 3+2n+nd+np 3+n+np 3+2n+nd 
+np +2p +2p +np 
SPM 3 Select a shift-right-by-6 mode on PR output. 
CNFP Config block BO as program memory (>FFXX). 
LARP 1 Use AR1 to address block Bl. 
LRLK 1,768 Point to lowest location in RAM block Bl. 
RPTK 255 Compute 256 sum-of-product operations. 


MAC >FFOO,*+ Multiply/accumulate and increment ARI. 


MAC Multiply and Accumulate MAC 
The following example shows register and memory contents before and af- 
ter the third step repeat loop: 


Before Instruction After Instruction 


aR ani 
PC/PFC PC/PFC >FFO3 


Data Data 
Memory >23 Memory >23 
770 770 


Program Program 
Memory >FAAA Memory >FAAA 


65282 


P >458972 P > FFFF453E 
ACC >723EC41 acc |[o|[ >7250266 
c 
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MACD Multiply and Accumulate with Data Move _MACD 


Syntax 
Direct: [<label>] MACD <dma>,<pma> 
Indirect: [<label>] MACD <pma>,{ind}[,<next ARP>] 


Operands O < pma < 65535 
0 < dma < 127 
O < next ARP < 7 


Execution TMS32020: 


(PC) +27 TOS 
(pma) ~ PC 


If (repeat counter) # 0: 
Then (ACC) + (shifted P register) ~ ACC, 
(dma) ~ T register, 
(dma) x (pma, addressed by PC) ~ P register, 
Modify AR(ARP) and ARP as specified, 
(PC) + 1 > PC, 
(repeat counter) - 1 ~ repeat counter. 
Else (ACC) + (shifted P register) > ACC 
(dma) ~ T register 
(dma) x (pma, addressed by PC) ~ P register 
Modify AR(ARP) and ARP as specified. 
(TOS) > PC 


Affects OV; affected by OVM and PM. 


TMS320C25: 

(PC) +27 PC 

(PFC) ~ MCS 

(pma) ~ PFC 

If (repeat counter) # 0: 
Then (ACC) + (shifted P register) ~ ACC, 
(dma) ~ T register, : 
(dma) x (pma, addressed by PFC) ~ P register, 
Modify AR(ARP) and ARP as specified, 
(PFC) + 1 > PFC, 
(repeat counter) - 1 ~ repeat counter. 
Else (ACC) + (shifted P register) ~ ACC 
(dma) ~ T register , 
(dma) x (pma, addressed by PFC) ~ P register 
Modify AR(ARP) and ARP as specified. 

(MCS) ~ PFC 


Affects C and OV; affected by OVM and PM. 
Encoding 15 14 13 12 1110 9 8 7 6 5 4 3 2 141 +0 


Direct:| 0 1 Oo 1 1 1 0 oO Lo | Data Memory Address 
Program Memory Address 


Indirect:}] 0 1 0 1 1 1 0 oO [ete | See Section 4.1 
Program Memory Address 
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MACD Multiply and Accumulate with Data Move  MACD 


Description The MACD instruction multiplies a data memory value (specified by dma) 
by a program memory value (specified by pma). It also adds the previous 
product, shifted as defined by the PM status bits, to the accumulator. The 
data and program memory locations on the TMS320C25 may be any non- 
reserved, on-chip or off-chip memory locations. If the program memory is 
block BO of on-chip RAM, then the CNF bit must be set to one. On the 
TMS32020, data and program memory locations must reside on-chip. Note 
that on both devices, the upper eight bits of the program memory address 
should be set to >FF in order to address BO program RAM, and the upper 
six bits of dma should be set to 0 to address a location below 1024. When 
used in the direct addressing mode, the dma cannot be modified during 
repetition of the instruction. If MACD addresses one of the memory- 
mapped registers or external memory as a data memory location, the effect 
of the instruction will be that of a MAC instruction (see the DMOV in- 
struction description). 


MACD functions in the same manner as MAC, with the addition of data 
move for block BO, B1, or B2. Otherwise, the effects are the same as for 
MAC. This feature makes MACD useful for applications such as convo- 
lution and transversal filtering. 


When the MACD instruction is repeated, the program memory address 
contained in the PC/PFC is incremented by one during its operation. This 
enables accessing a series of operands in memory. When used with RPT 
or RPTK, MACD becomes a single-cycle instruction once the RPT pipeline 
is started. 


Words 2 


Cycle Timings for a Single Instruction 
PI/DI PI/DE PE/DI PE/DE PR/DI PR/DE 
ps NA T8429 PAP 


Table in on-chip RAM: 
3 4+d 4+2p 5+d+2p 4 5+d 


Table in on-chip ROM: 
4 5+d 4+2p 5+d+2p 4 5+d 
Table in external memory: 
4+p 5+d+p 4+3p 5+d+3p 


Cycle Timings for a Repeat Execution 
atm [N/A | 24nt2p | NWA | - | 


‘20 


‘C25 | Table in on-chip RAM: 
2+n 2+2n+nd 3+n+2p 3+2n+nd+2p 3+n 3+2n+nd 
Table in on-chip ROM: 
3+n 3+2n+nd 3+n+2p 3+2n+nd+2p 3+n 3+2n+nd 
Table in external memory: 
3+n+np 3+2n+nd 3+nt+np 3+2n+nd+np 3+n+np 3+2n+nd 


+np +2p +2p +np 
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MACD Multiply and Accumulate with Data Move MACD 


Example SPM 0 Select no shift mode on PR output. 
SOVM Set overflow mode. 
CNFP Config block BO as program memory (>FFXX). 
LARP 3 Use AR3 to address block Bl. 
LRLK 3,1023 Point to highest location in RAM block Bl. 
RPTK 255 Compute 1 sample of a length-256 
convolution. 


MACD >FFOO,*- Multiply/accumulate, shift data word in 
block Bl, and decrement AR3. 


The following example shows register and memory contents before and af- 
ter the third step repeat loop: 


Before Instruction After Instruction 
ARt ari 
PC/PFC >FFO2 PC/PFC >FFO3 
Data Data 
Memory >23 Memory >23 
1021 1021 
Data Data 
Memory >7FC Memory >23 
1022 1022 
Program Program 
Memory >FAAA Memory >FAAA 


65282 65282 
>458972 


>723EC41 


>FFFF453E. 
>76975B3 


5 3 
oO 
o[e] 


> 
oO 
ro) 
oO Lx] 


Note: 


The data move function for MACD can only occur within on-chip data 
RAM blocks BO, B1, and B2. 
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MAR Modify Auxiliary Register MAR 


Syntax 
Direct: 
Indirect: 


' Operands 


Execution 


Encoding 
Direct: 


Indirect: 


Description 


Words 
Cycles 


‘20 
‘C25 


‘20 
’C25 


Example 1 


Example 2 


[<label>] MAR <dma> 
{[<label>] MAR {ind}[,<next ARP>] 


O < dma < 127 
O < next ARP < 7 


(PC) +17 PC 
Modifies ARP, AR(ARP) as specified by the indirect addressing field 
(acts as a NOP in direct addressing). 


15 14 13 12 1110 9 8 7 6 5 4 3 2 1 #90 


0 1 0 1 0 1 (0) 1 Data Memory Address 
a a ee ee eee See Section 4.1 


The MAR instruction acts as a no-operation instruction in the direct ad- 
dressing mode. In the indirect addressing mode, the auxiliary registers and 
the ARP are modified; however, no use is made of the memory being refer- 
enced. MAR is used only to modify the auxiliary registers or the ARP. The 
old ARP is copied to the ARB field of status register ST1. Note that any 
operation that MAR performs can also be performed with any instruction 
that supports indirect addressing. ARP may also be loaded by an LST in- 
struction. 


In the direct addressing mode, MAR is a NOP. Also, the instruction LARP 
is a subset of MAR (i.e., MAR *,4 performs the same function as LARP 4). 


Cycle Timings for a Single Instruction 
Pl/DI PI/DE PE/DI PE/DE PR/DI PR/DE 


MAR #*,1 Load the ARP with 1 


Before Insjruction After Instruction 
MAR *- Decrement current auxiliary register (in this 
case, AR1) 
Before Instruction After Instruction 
AR1 >35 AR1 >34 
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MAR Modify Auxiliary Register MAR 


Example 3 MAR *+,5 Increment current auxiliary register (AR1) and 
load ARP with 5. 
Before Instruction After Instruction 
AR1 
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MPY Multiply MPY 


Syntax 
Direct: [<label>] MPY <dma> 
Indirect: [<label>] MPY {ind}[,<next ARP>] 


Operands 0 <s dma s 127 
O < next ARP < 7 


Execution (PC) +17 PC 
(T register) x (dma) ~ P register 


Encoding 15 14 13 12 1110 9 8 7 6 5 4 3 2 1 =#«0 
0 


Direct:| 0 0O 1 1 , 0.6060 Data Memory Address 
Indirect:|0 0 1 #1 +4 +0 © of1 | See Section 4.1 


Description The contents of the T register are multiplied by the contents of the ad- 
dressed data memory location. The result is placed in the P register. 


Words 1 


Cycles 
Cycle Timings for a Single Instruction 
PI/DI PI/DE PE/Di PE/DE PR/DI PR/DE 
Example MPY DAT13 (DP = 8) 
or 
MPY * If current auxiliary register contains 1037. 
Before Instruction After Instruction 
Data Data 
Memory Memory 
1037 1037 
T T 
P >36 P >2A 
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Multiply and Accumulate 
MPYA Previous Product (TMS320C25 _MPYA 


Syntax 
Direct: [<label>] MPYA <dma> 
Indirect: [<label>] MPYA {ind}[,<next ARP>] 


Operands O < dma < 127 
O < next ARP < 7 


Execution (PO) > PC 
(ACC) + (shifted P register) ~ ACC 
(T register) x (dma) ~ P register 


Affects C and OV; affected by OVM and PM. 
Encoding 15 14 13 12 1110 9 8 7 6 5 4 3 2 1 #0 


Direct:| 0 O 1 1 1 Oo 1 0 Data Memory Address 
Indirect:|0 0 1 #1 #1 +0 14 «Off | __ See Section 4.1 


Description The contents of the T register are multiplied by the contents of the ad- 
dressed data memory location. The result is placed in the P register. The 
previous product, shifted as defined by the PM status bits, is also added to 
the accumulator. 


Cycle Timings for a Single Instruction 


Example MPYA DAT13 (DP = 6, PM = 0) 
or 
MPYA * If current auxiliary register contains 781. 
Before Instruction After Instruction 

Data Data 
781 781 
T an 
P >36 P >2A 


ace [see] = ace sa] 
Cc Cc 
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MPYK Multiply Immediate MPYK 


Syntax 
Operands 


Execution 


Encoding 


[<label>] MPYK <constant> 


-4096 < constant < 4095 
-212 < constant < 2!2 - 1 


(PC):+ 1. == PC 
(T register) x constant ~ P register 


Not affected by SXM. 
15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 =0 


1 0 1 13-Bit Constant 


Description The contents of the T register are multiplied by the signed, 13-bit constant. 


Words 
Cycles 


Example 


The result is loaded into the P register. The immediate field is right-justified 
and sign-extended before multiplication, regardless of SXM. 


1 


zof{_1 | 1 {tp | tp | - | - 
(oS TE (RS TE 
not repeatable 


MPYK -9 


‘20 
‘C25 


Before Instruction After Instruction 


P >2A P >FFFFFFC1 
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Multiply and Subtract 


MPYS Previous Product (TMS320C25) MPYS | 


Syntax 
Direct: [<label>] MPYS <dma> 
Indirect: [<label>] MPYS {ind}[,<next ARP>] 


Operands 0 < dma < 127 
O < next ARP < 7 


Execution (PC) +7 = PC 
(ACC) - (shifted P register) ~ ACC 
(T register) x (dma) ~ P register 


Affects C and OV; affected by OVM and PM. 
Encoding 15 14 13 12 1110 9 8 7 6 5 4 3 2 1 =O 


Direct: | 0 0 1 1 1 0 1 1 Data Memory Address 
Indirect} 0 0 1 #4 +14 +0 1 =«441 4 See Section 4.1 


Description The contents of the T register are multiplied by the contents of the ad- 
dressed data memory location. The result is placed in the P register. The 
previous product, shifted as defined by the PM status bits, is also sub- 
tracted from the accumulator. 


Words 1 


Cycles 
Cycle Timings for a Single Instruction 
PI/DI PI/DE PE/DI PE/DE PR/DI PR/DE ; 


cap] | td tee 2tdep | td 
Cycle Timings for a Repeat Execution 
25 [__a | tentnd | n+p [tentndep| mn | 1+nend | 
Example MPYS DAT13 (DP = 6, PM = 0) 
or 
MPYS * If current auxiliary register contains 781. 
Before Instruction After Instruction 
Data Data 
Memory Memory 
781 781 
T T 
P >36 P >2A 
Cc C 
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MPYU 


Syntax 


Multiply Unsigned (TMS320C25 MPYU 


Direct: [<label>] MPYU <dma> 
Indirect: [<label>] MPYU {ind}[,<next ARP>] 


Operand 


Ss 


Execution 


Encoding 


Indirect: 


Descript 


Words 
Cycles 


Example 


Direct: 


ion 


‘C25 


"C25 


O < dma < 127 
0 < next ARP < 7 


(PC) 1 = PC 
Unsigned (T register) x unsigned (dma) ~ P register 


16 14 13 12 1110 9 8 F7 6 5 4 3 2 14 +9 


a a ee ee ee ee ee . See Section 4.1 | 


The unsigned contents of the T register are multiplied by the unsigned 
contents of the addressed data memory location. The result is placed in the 
P register. Note that the multiplier acts as a 17 x 17-bit signed multiplier for 
this instruction, with the MSB of both operands forced to zero. 


The shifter at the output of the P register will always invoke sign-extension 
on the P register when PM = 3 (right-shift by 6 mode). Therefore, this shift 
mode should not be used if unsigned products are desired. ; 


The MPYU instruction is particularly useful for computing multiple-preci- 
sion products, such as when multiplying two 32-bit numbers to yield a 
64-bit product. 


1 


Cycle Timings for a Single Instruction 
PI/DI PI/DE PE/DI PE/DE PR/DI PR/DE 
Pa [ate [tp | aarp | 1 iY 


Cycle Timings for a Repeat Execution 


pon tentnd | ont | ttntnd+p] on | ttntnd | 


MPYU DAT16 (DP = 4) 


or 
MPYU * If current auxiliary register contains 528. 
_ Before Instruction After Instruction 
Data Data 
Memory >FFFF Memory >FFFF 
528 528 


T | >FFFF T >FFFF 
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- I air en mena 


NEG Negate Accumulator NEG 


Syntax 
Operands 


Execution 


Encoding 


Description 


Words 
Cycles 
‘20 
"C25 
‘20 
"C25 
Example 
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[<label>] NEG 
None 


(PC) +17 PC 
(ACC) x -1 * ACC 


Affects OV; affected by OVM. 
Affects C (TMS320C25). 


15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 =O 
1 1 0 0 14 4 4 0 0 O 41 O08 OF O 1 1 


The contents of the accumulator are replaced with its arithmetic comple- 
ment (two’s complement). The OV bit is set when taking the NEG of 
>80000000. If OVM = 1, the-accumulator contents are replaced with 
>7FFFFFFF. If OVM = 0, the result is >80000000. The carry bit C on the 
TMS320C25 is reset to zero by this instruction for all nonzero values of the 
accumulator, and set to one if the accumulator equals zero. 


1 


Cycle Timings for a Single Instruction 
ee ae ae ee ee ee 
es Re eae RS ee ae ee i ee es 

Cycle Timings for a Repeat Execution 

a a ee a ee ee eee 

ee Te ee ee ee ee 


NEG 


Before Instruction After Instruction 


ace i) L=trreraa} ace [| [____>00e | 
c C 


NOP No Operation NOP 


Syntax [<label>] NOP 
Operands None 
Execution (PC) + 1 °> PC 


Encoding 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 #0 
Oo 1 oO 1 0 1 0 14 0 0 0 0 0 0 0 0 


Description No operation is performed. The NOP instruction affects only the PC. NOP 
functions in the same manner as the MAR instruction in the direct ad- 
dressing mode; NOP has the same opcode as MAR in the direct addressing 
mode with dma = 0. 


The NOP instruction is useful as a pad or temporary instruction during 
program development. 


Words 1 
Cycles 
Cycle Timings for a Single Instruction 
PI/DI PI/DE PE/DI PE/DE PR/DI PR/DE 


Example NOP 
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NORM 


Syntax 


Operands 


Execution 


Encoding 


Description 


Words 
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Normalize Contents of Accumulator NORM 


[<label>] NORM (TMS32020) 
[<label>] NORM {ind} (TMS320C25) 


None 


TMS32020: 
(PC) +1—7PC 


lf (ACC(31)).XOR.(ACC(30)) = 0: 
Then TC > 0, 
(ACC) x 2 ~ ACC, 
Modify AR(ARP) as specified; 
Else TC > 1. 


Affects TC; affected by TC. 


TMS320C25: 
(PC) +1 PC 


If (ACC) = 
Then TC > i 
Else, if (ACC(31 )).XOR. (ACC(30)) = 
Then TC ~ 0, 
_ (ACC) x 2 ~ ACC, 
Modify AR(ARP) as specified; 
Else TC > 1. 


Affects TC; affected by TC. 
15 14 13 12 1110 9 8 7 6 5 4 3 2 1 #0 


1 1 0 oO 1 1 1 0 11 Modify AR 0 0 1 O 


The NORM instruction is provided for normalizing a signed number that ts 
contained in the accumulator. Normalizing a fixed-point number separates 
it into a mantissa and an exponent. To do this, the magnitude of a sign- 
extended number must be found. ACC bit 31 is exclusive-ORed with ACC 
bit 30 to determine if bit 30 is part of the magnitude or part of the sign ex- 
tension. If they are the same, they are both sign bits, and the accumulator 
is left-shifted to eliminate the extra sign bit. The AR(ARP) is modified as 
specified to generate the magnitude of the exponent. It is assumed that 
AR(ARP) is initialized before the normalization begins. The default mod- 
ification of the AR(ARP) is an increment. 


Multiple executions of the NORM instruction may be required to completely 
normalize a 32-bit number in the accumulator. Although using NORM with 
RPT or RPTK does not cause execution of NORM to “fall out” of the repeat 
loop automatically when the normalization is complete, no operation is 
performed for the remainder of the repeat loop. Note that NORM functions 
on both positive and negative two’s-complement numbers. 


1 


NORM Normalize Contents of Accumulator NORM 


Cycles 
Cycle Timings for a Single Instruction 
~~ PI/DE PE/DI PE/DE PR/DI PR/DE 
‘20 me ei DE eee ce 
7 een Deas Ee (Te 
Cycle Timings for a Repeat Execution 
ON Mie Ja od sii, i ee 
‘capt oon fe te te Pe 
Example 1 31-Bit Normalization: 
LARP 1 Use AR1 for exponent sturage. 
LARK 1,0 ‘Clear out exponent counter. 
LOOP NORM e+ One bit is normalized. 
BBZ LOOP If TC = 0, magnitude not found yet. 
Example 2 15-Bit Normalization: 
LARP 1 Use AR1 to store the exponent. 
LARK 1,15 Initialize exponent counter. 
RPTK 14 15-bit normalization is specified 
(yielding a 4-bit exponent and 16-bit 
mantissa). 
NORM vs NORM automatically stops shifting 


when the first significant magnitude 
bit is found, performing NOPs for the 
remainder of the repeat loop. 


The first method is used to normalize a 32-bit number and yields a 5-bit 
exponent magnitude. The second method is used to normalize a 16-bit 
number and yields a 4-bit exponent magnitude. |f the number requires only 
a small amount of normalization, the first method may be preferable to the 
second. This results because Example 1 runs only until normalization is 
complete. Example 2 always executes all 15 cycles of the repeat loop. 
Specifically, Example 1 is more efficient if the number requires five or less 
shifts. If the number requires six or more shifts, Example 2 is more efficient. 


Note: 


The TMS32020 accepts only the NORM instruction (no operand). 
Source code compatibility of the TMS320C25 with the TMS32020 al- 
lows the NORM instruction to also be used without a specified oper- 
and. In that case, any comments on the same line as the instruction will 
be interpreted as the operand. If the first character is an asterisk (*), 
then the instruction will be assembled as NORM * with no auxiliary 
register modification taking place upon execution. The user is therefore 
advised to replace the NORM instructions with NORM *+ when the 
default modification of increment is desired. 


The resulting value in the auxiliary register will not be the real exponent 
of the number for all modification options. However, it can always be 
used to obtain the exponent. 
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OR OR with Accumulator OR 


Syntax 
Direct: [<label>] OR <dma> 
Indirect: [<label>] OR {ind}[,<next ARP>] 


Operands O<dma<127 | 
O < next ARP < 7 


Execution (PC) +1 7PC 
(ACC(15-0)) .OR.dma ~ ACC(15-0) 
(ACC(31-16)) > ACC(31-16) 


Not affected by SXM. 


Encoding 15 14 13 12 11109 8 7 65 43 21 ~0 
Indirect:| 0 1 0 0 141 1 +0 441 | See Section 4.1 


Description The low-order bits of the accumulator are ORed with the contents of the 
addressed data memory location. The high-order bits of the accumulator 
are ORed with all zeroes. Therefore, the upper half of the accumulator is 
unaffected by this instruction. 


Cycle Timings for a Single Instruction 


20[__» | _2ntnd [n+p Le 
cos [__n | t+ntnd | np [ttnendtp] a | toned | 
Example OR DATS8 (DP = 8) 
or 
OR * Where current auxiliary register contains 1032. 
Before Instruction After Instruction 
Data Data 
Memory >FO00 Memory >FO0O 
1032 1032 


C Cc 
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ORK 


Syntax 
Operands 


Execution 


Encoding 


OR Immediate with Accumulator with Shift ORK 


Description 


Words 
Cycles 


Example 


‘20 


’C25 


‘20 


‘C25 


[<label>] ORK <constant> [,<shift>] 


16-bit constant 

O < shift < 15 (defaults to 0) 

(PC) +27 PC 
(ACC(30-0)).OR. [constant x 2Shift} + ACC(30-0) 
(ACC(31)) 7 ACC(31) 


Not affected by SXM. 
15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 =0 


v0. 0 00h 04 
16-Bit Constant 


The left-shifted 16-bit immediate constant is ORed with the accumulator. 
The result is left in the accumulator. Low-order bits below and high-order 
bits above the shifted value are treated as zeroes. The corresponding bits 
of the accumulator are unaffected. Note that the most-significant bit of the 
accumulator is not affected, regardless of the shift code value. 


2 


PR/DI 
2 eee | ee eee 
[__rotrepeatabie Cd 


ORK >FFFF,8 


Before Instruction After Instruction 


Cc » iG 
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OUT Output Data to Port OUT 
Syntax 


Direct: [<label>] OUT <dma>,<PA> 
indirect: [<label>] OUT {ind},<PA>[,<next ARP>] 


Operands 0 < dma s 127 
0 < next ARP < 7 
O < port address PA < 15 


Execution (PO) es > Pe 
Port address PA > address bus A3-AO 
0 ~ address bus A15-A4 
(dma) ~ data bus D15-D0 


Encoding 15 14 13 12 11 10 9 


Direct: Ean eee Port Address Data Memory Address 
Indirect:| 11 1  O]|  PortAddress | 1 | See Section 4.1 


‘Description The OUT instruction writes a 16-bit value from a data memory location to 
the specified |/O port. The IS line goes low to indicate an I/O access, and 
the STRB, R/W, and READY timings are the same as for an external data 
memory write. OUT is a single-cycle instruction when in the PI/DI memory 
configuration (see Appendix E). 


Words 1 


Cycles 
pitt | 2tdti | 2tpti_ | 3tdtpti | - - 
 caeat—[aaetent | anepeni fanendepen = [>] 


+ni 


Example OUT >78,7 (DP = 4) Output data word stored in data 
memory location >78 to peripheral on 
port address 7. 
OUT * OF Output data word referenced by current 
auxiliary register to peripheral on port 
address >F.. 


‘20 
‘C25 


‘20 
‘C25 
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PAC Load Accumulator with P Register PAC 


Syntax [<label>] PAC 
Operands None 


Execution (PC) +'1-> PC 
(shifted P register) ~ ACC 


Affected by PM. 


Encoding 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 =0 
1 1 0 60 1 1 1 0 O oO 90 1 0 1 0 0 


Description The contents of the P register are loaded into the accumulator, shifted as 
specified by the PM status bits. 


Words 1 
Cycles 


Cycle Timings for a Single Instruction 
PI/DI PI/DE PE/D! PE/DE PR/DI PR/DE 


{1 [1 | ts |} wp | - | - 
2 a HE SOE A 
ye a A ECT A OT OY A 
cos ne [| np Pomp [| fn 


Example PAC (PM = 0) 


Before Instruction After Instruction 


Po 
acc =] Ace 
Cc Cc 
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POP Pop Top of Stack to Low Accumulator POP 


Syntax {<label>] POP 
Operands None 


Execution (PC) +17 PC 
(TOS) > ACC(15-0) 
0 ~ ACC(31-16) 
Pop stack one level. 


Encoding 15 14 13 12 1110 9 8 7 6 5 4 3 2 1 =#«0 
1 1 0O oO 1 1 14 0 0 0 0 41 1 1 Oo 1 


Description The contents of the top of the stack (TOS) are copied to the low accu- 
mulator, and the stack popped after the contents are copjed. The upper half 
of the accumulator is set to all zeros. 


The hardware stack is a last-in, first-out stack with four.(TMS32020) or 
eight (TMS320C25) locations. Any time a pop occurs, every stack value 
is copied to the next higher stack location, and the top value is removed 
from the stack. After a pop, the bottom two stack words will have the same 
value. Because each stack value is copied, if more than three/seven pops 
(due to POP, POPD, or RET instructions) occur before any pushes occur, 
all levels of the stack contain the same value. No provision exists to check 
stack underflow. 


Cycle Timings for a Single Instruction 
Pi/DI 


Cycle Timings for a Repeat Execution 
20 [29 _| 
cos [__n 


Example POP 
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POP Pop Top of Stack to Low Accumulator POP 


Before Instruction 


Cc 


Stack 
(2C) 


Stack 
(C25) 


After Instruction 
acc |x| [ >45 | 
oo 


Stack >16 
(20) >7 
>33 
>33 
Stack 


(C25) 
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POPD Pop Top of Stack to Data Memory POPD 


Syntax 
Direct: [<label>] POPD <dma> 
Indirect: [<label>] POPD {ind}[{,<next ARP>] 


Operands O < dma s 127 
O < next ARP < 7 


Execution (PC) 1 = PC 
(TOS) ~*~ dma 
POP stack one level. 


Encoding 15 14 13 12 1110 9 8 7 6 5 4 3 2 1 +0 


Direct:} O 1 1 1 1 Oo 1 0 Data Memory Address 
Indirect:| 0 1 1 1 1 0 1 0f71 | See Section 4.1 


Description The value from the top of the stack is transferred into the data memory lo- 
cation specified by the instruction. The values are also popped in the lower 
three (TMS32020) or seven locations (TMS320C25) of the stack. The 
hardware stack is described in the previous instruction POP. The lowest 
stack location remains unaffected. No provision exists to check stack un- 
derflow. 


Words 1 
Cycles 


[2 | 2d | 2p | 2edep | - [| - 
ee 


‘20 
‘C25 


‘20 
‘C25 


Example POPD DAT100 (DP = 8) 


or 
POPD * If current auxiliary register contains 1124, 
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— - 


POPD Pop Top of Stack to Data Memor POPD 


Before Instruction After Instruction 


Memory >55 Memory >92 
1124 1124 
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PSHD Push Data Memory Value onto Stack PSHD 


Syntax 
Direct: [<label>] PSHD <dma> 
Indirect: [<label>] PSHD {ind}[,<next ARP>] 


Operands 0 < dma s 127 
0 < next ARP < 7 


Execution (dma) ~ TOS 
(PC) +17 PC 
Push all stack locations down one level. 


Encoding 15 14 13 12 1110 9 8 7 6 5 4 3 2 1 #90 


Direct:| O 1 0 1 0 1 0 0 Data Memory Address 
Indirect:} 0 1 0 1 +0 1 +0 Of1 | See Section 4.1 


Description The value from the data memory location specified by the instruction is 
transferred to the top of the stack. The values are also pushed down in the 
lower three (TMS32020) or seven locations (TMS320C25) of the stack, 
as described in the instruction PUSH. The lowest stack location is lost. 


Words 1 


Cycles 
ao] 2 | +d tp | atdep | 
cos[__1 | 2d | +p | 2tdep | 1 | 2rd | 
20 [an | 2ntnd | 2ntp | 2ntndep | - [| - _| 
cas [__n | t#ntnd | ntp | t+nendtp| mn | t+nend_| 


Example PSHD DAT127 (DP = 3) 


or 
PSHD 7 If current auxiliary register contains 511. 
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PSHD Push Data Memory Value onto Stack PSHD 


Before Instruction After Instruction 
Data Data 
Memory >65 Memory . >65 

511 511 

Stack 

(20) 

Stack 

(C25) 
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PUSH 


Syntax 
Operands 


Execution 


Encoding 


Description 


Words 
Cycles 
‘20 
’C25 
‘20 
"C25 
Example 
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ad oe agin ada eee 


Push Low Accumulator onto Stack PUSH 


[<label>] PUSH 
None 


(PC) +1 7 PC 
Push all stack locations down one level. 
(ACC(15-0)) > TOS 


15 14 13 12 1110 9 8® 7 6 &5& 4 3 2 1 +90 
1 41 0 0 4 4 4 0 O08 O O 4 #4 4 OO O 


The contents of the lower half of the accumulator are copied onto the top 
of the hardware stack. The stack is pushed down before the accumulator 
value is copied. 


The hardware stack is a last-in, first-out stack with four (TMS32020) or 
eight locations (TMS320C25). If more than four/eight pushes (due to 
CALA, CALL, PSHD, PUSH, or TRAP instructions) occur before a pop, the 
first data values written will be lost with each succeeding push. 


1 


Cycle Timings for a Single Instruction 
a ae Ee ae es ee ee 
a a a ee a ee a ee ee 


Cycle Timings for a Repeat Execution 


p2n | an S| rtp | ante | OC 
ae ae ee ee ee eee eee 
PUSH 
Before Instruction After Instruction 
Cc Cc 
Stack >2 Stack 
(20) >5 (20) 
>3 
>0 
Stack Stack 
(C25) (C25) 
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RC Reset Carry Bit (TMS320C25) RC 


Syntax [<label>] RC 
Operands None 


Execution (PC) + 7-2 PC 
0 > carry bit C in status register ST1 


Affects C. 


Encoding 15 14 13 12 1110 9 8 7 6 5 4 3 2 1 490 
1 1 Oo #60 1 1 1 0 oO oO 1 1 0 0 0 0 


Description The carry bit C in status register ST1 is reset to logic zero. The carry bit 
may also be loaded directly by the LST1 and SC instructions. 


Words 1 


Cycles 
Cycle Timings for a Single Instruction 
PI/DI PI/DE PE/DI PE/DE PR/DI PR/DE 
Cycle Timings for a Repeat Execution 
Example RC The carry bit C is reset to logic zero. 
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RET Return from Subroutine RET 


Syntax [<label>] RET 
Operands None 


Execution (TOS) ~ PC 
Pop stack one level. 


Encoding 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 #90 
1 1 0 0 1 1 1 0 0 0 1 0 0 1 1 #O 


Description The contents of the top stack register are copied into the program counter. 
The stack is then popped one level. RET is used with CALA and CALL for 
subroutines. 


Cycles 
Cycle Timings for a Single Instruction 
PI/DI PI/DE PE/DI PE/DE PR/DI PR/DE 
rot 2 | 2 | 2» | 2» | - | - | 
'C25 Destination on-chip RAM: 
2 2 2+p 
Destination on-chip ROM: 
3 3+p 
Destination external memory: 
3+p 3+p 3+2p 
Cycle Timings for a Repeat Execution 
20[-notrepeatble CdS dC 
’C25 not repeatable 
Example RET 
Before Instruction After Instruction 
PC >96 PC >37 
Stack >37 '  $tack >45 
(20) >45 (20) >75 
>75 >21 
>21 >21 
Stack Stack 
(C25) (C25) 
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Reset Serial Port Frame 


RFSM Synchronization Mode (TMS320C25) RFSM 


Syntax 
Operands 


Execution 


Encoding 


Description 


Words 
Cycles 
"C25 
‘C25 
Example 


[<label>] RFSM 
None 


(PC) 1: = PC 
0 > FSM status bit in status register ST1 


Affects FSM. 


15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 =O 
1 1 0 0 1 1 1 0 0 0 1 1 0 1 1 0 


The RFSM status bit resets the FSM status bit to logic zero. In this mode, 
external FSR pulses are not required to initiate the receive operation for 
each word received, but rather only one FSR pulse is required to initiate a 
“continuous mode” of operation. The same holds true for FSX when TXM 
= 0. After the first FSR/FSX pulse, these inputs are then in a “don’t care” 
state. If TXM = 1, FSX is pulsed the first time DXR is loaded, but remains 
low thereafter. See Section 3.9 for further details on the operation of the 
serial port. FSM may also be loaded by the LST1 and SFSM instructions. 


1 


+t oe | te Pt Ot 
[on [nm mp | np | on 


RFSM FSM is reset, putting the serial port 
in a mode of operation where frame 
synchronization pulses are not required. 
This allows a continuous bit stream to 
be transmitted/received without FSX/FSR 
pulses every 8/16 bits. 
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RHM 


Syntax 
Operands 


Execution 


Encoding 


Description 


Words 
Cycles 


cas[1 [1 | 1+p | tp | 1 | 1 
ro A A OO 


Example 
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Reset Hold Mode (TMS320C25 RHM 


[<label>] RHM 
None 


(PC) +17 PC 
0 ~ HM status bit in status register ST1 


Affects HM. 


15 14 13 12 1110 9 8 7 6 5 4 3 2 1 #0 
i1 1 0 0 4 4 41 O0O O O 1 4 4 8 O ODO 


The RHM instruction resets the HM status bit to logic zero. In this mode, 
the TMS320C2x is not halted during the assertion of HOLD when executing 
from on-chip program memory (either RAM or ROM), but instead places 
its external buses in the high-impedance state and continues execution 
until an external access must be made. External access can mean (in addi- 
tion to the normal connotation) the following conditions: 


MP/MC CNF PC 
0 0 PC 4096 
0 1 4096 < PC < 65279 
1 0 Any PC value (normal 
TMS32020-type hold mode) 
1 1 PC < 65279 


HM can also be loaded by the LST1 and SHM instructions. 
1 


Cycle Timings for a Single Instruction 
PI/DI PI/DE PE/DI PE/DE PR/DI PR/DE 


Cycle Timings for a Repeat Execution 


RHM HM is reset, implementing the TMS320C25 
hold mode for on-chip program execution. 


ROL Rotate Accumulator Left (TMS320C25 ROL 


Syntax [<label>] ROL 
Operands None 


Execution (PC) +17 PC 
(ACC(31)) 7 C 
(ACC(30-0)) ~ ACC(31-1) 
(C, before ROL) ~ ACC(0) 
Affects C. 
Not affected by SXM. 


Encoding 15 14 13 12 1110 9 8 7 6 5 4 3 2 1 #0 
1 1 0 0 4 #4 tt O dO O 14 4 0 14 OO O 


Description The ROL instruction rotates the accumulator left one bit. The MSB is 
shifted into the carry bit, and the value of the carry bit from before the exe- 
cution of the instruction is shifted into the LSB. 


Cycle Timings for a Single Instruction 


cof 1 [1 {| wp | te | 1 | 1 
coset» [| =e | oe | =» ] -» 


Example ROL 


Before Instruction After Instruction 


 & C 
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ice aan oe anal een ete re! a eg TS, 


ROR Rotate Accumulator Right (TMS320C25 ROR 


Syntax [<label>] ROR 
Operands None 


Execution (PO) 1 PC 
(ACC(0)) 7 C 
(ACC(31-1)) ~*~ ACC(30-0) 
(C, before ROR) ~ ACC(31) 


Affects C. 
Not affected by SXM. 


Encoding 15 14 13 12 11 10 9 8 7 6 5 4 3 2 14 =0 
1 1 0 0 1 1 1 0 0 0 1 1 0 1 Oo 1 


Description The ROR instruction rotates the accumulator right one bit. The LSB is 
shifted into the carry bit, and the value of the carry bit from before the exe- 
cution of the instruction is shifted into the MSB. 


Words 1 


Cycles 

CIB io A pele le Wee fe in 
| Cycle Timings fora Repeat Execution 

cosa.» | op | np [| 1 |» | 


Example ROR 


Before Instruction After Instruction 


ACC [oj >B0001234 ACC [o] >5800091A 
C c 
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ROVM 


Syntax 
Operands 


Execution 


Encoding 


Description 


Example 


Reset Overflow Mode ROVM 


[<label>] ROVM 
None 


(PC) +17 PC 
0 > OVM status bit in status register STO 


Affects OVM. 


15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 #0 
1 #1 0 0 1 14 1 0 0 0 0 0 0 0 1 «0 


The OVM status bit is reset to logic zero, which disables the overflow mode. 
If an overflow occurs with OVM reset, the OV (overflow flag) is set, and the 
overflowed result is placed in the accumulator. 


OVM may also be loaded by the LST and SOVM instructions. 
1 


ROVM The overflow mode bit OVM is reset, 
disabling the overflow mode on any 
subsequent arithmetic operations. 
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Repeat Instruction as 


RPT Specified by Data Memory Value RPT 


Syntax 


Direct: 
Indirect: 


Operand 


s 


Execution 


Encoding 


Indirect: 


Descript 


Words 
Cycles 


Example 
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Direct: 


ion 


'20 
'C25 


‘20 
"C25 


[<label>] RPT <dma> 
[<label>] RPT {ind}{,<next ARP>] 


O < dma < 127 
O < next ARP < 7 


(PC) +17PC 
(dma(7-0)) ~ RPTC 


1514 1312 11109 8 7 6 5 43 210 
1 


0 1 0 0 1 0 1 Data Memory Address . 
ee a ee ae See Section 4.1 


The eight LSBs of the addressed data memory value are loaded into the 
repeat counter (RPTC). This causes the following instruction to be exe- 
cuted one time more than the number loaded into the RPTC (provided that 
it is a repeatable instruction). Interrupts are masked out until the next in- 
struction has been executed the specified number of times. (Interrupts 
cannot be allowed during the RPT/next instruction sequence, because the 
RPTC cannot be saved during a context switch.) The RPTC counter is 
cleared on a RS. 


RPT and RPTK are especially useful for repeating instructions, such as 
BLKP, BLKD, IN, MAC, MACD, NORM, OUT, TBLR, TBLW, and others. 


1 


[1 | 2a | tsp [| aeaep | - | - 
P| ara tp | aedep | 1 | ara 
[not repeatable 


RPT DAT127 (DP = 31) 

SFR 

or 

RPT 2 If current auxiliary register contains 4095. 
SFR 


Repeat Instruction as 


RPT Specified by Data Memory Value RPT 
Before Instruction After Instruction 
Data Data 
4095 4095 


ace fi) [Simmase7=] © ace (o] [saan] 
Cc c 
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Repeat Instruction as 


RPTK Specified by Immediate Value RPTK 
Syntax [<label>] RPTK <constant> 
Operands O < constant < 255 
Execution (PC) +t - PC 
Constant ~ RPTC 
Encoding 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 9 


Description 


Words 
Cycles 
20 
‘C25 
‘20 
'C25 
Example 
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The 8-bit immediate value is loaded into the RPTC (repeat counter). This 
causes the following instruction to be executed one time more than the 
number loaded into the RPTC (provided that it is a repeatable instruction). 
Interrupts are masked out until the next instruction has been executed the 
specified number of times. (Interrupts cannot be allowed during the 
RPT/next instruction sequence because the RPTC cannot be saved during 
a context switch.) The RPTC is cleared on a RS. 


RPT and RPTK are especially useful for repeating instructions, such as 
BLKP, BLKD, IN, MAC, MACD, NORM, OUT, TBLR, TBLW, and others. 


1 


Cycle Timings for a Single Instruction 


ee (SE GT Ta ES RE 
lineata 


LRLK AR2,>200 Load AR2 with the address of X. 
LARP 2 

ZAC Clear the accumulator. 

MPYK O Clear the P register. 

RPTK 2 Repeat next instruction 3 times. 
SQRA E+ Compute X**2 + Y**2 + Z**2., 

APAC 


RSXM Reset Sign-Extension Mode RSXM 


Syntax [<label>] RSXM 
Operands -_ None 


Execution (PC) 1: PC 
0 ~ SXM sign-extension mode status bit 


Affects SXM. 


Encoding 15 14 13 12 1110 9 8 7 6 5 4 3 2 4 =O 
1 1 0 0 1 1 1 0 0 0 0 0 0 1 1 O 


Description The RSXM instruction resets the SXM status bit to logic zero, which sup- 
presses sign-extension on shifted data memory values for the following 
arithmetic instructions: ADD, ADDT, ADLK, LAC, LACT, LALK, SBLK, SUB, 
and SUBT. 


The RSXM instruction affects the definition of the SFR instruction. SXM 
may also be loaded by the LST1 and SSXM instructions. 


Words 1 


Cycles 

a Re eee Se ee ee ee ee ee 
RE a (ee (Se ee a ee ee 
[a [| ne ] mp | - 
ae ee Ra er ae ee er eee (eee ee 


Example RSXM SXM is reset, disabling sign-extension on 
subsequent instructions. 


‘20 
‘C25 


‘20 
'C25 
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RTC Reset Test/Control Flag (TMS320C25) RTC 


Syntax [<label>] RTC 
Operands None 


Execution (PC) +1 = PC 
- 0Q-~ TC test/control flag in status register ST1 


Affects TC. 


Encoding 19° 14-18- 12) AT 40-9. 8" 7. GG. Be B22 DD 
1 1 0 60 1 1 1 0 oO O 1 1 0 0 1 O 


Description The TC (test/control) flag in status register ST1 is reset to logic zero. TC 
may also be loaded by the LST1 and STC instructions. 


Words 1 


PRIDE 
or | en 
co 


Example “RTC TC (test/control) flag is reset to logic zero. 


Cycles 
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RTXM 


Syntax 
Operands 


Execution 


Encoding 


Description 


Words 
Cycles 


Example 


‘20 
’C25 


‘20 
‘C25 


Reset Serial Port Transmit Mode | RTXM 


[<label>] RTXM 
None 


(PC). +1. PC 
O ~ TXM transmit mode status bit 


Affects TXM mode bit. 


15 14 13 12 11 10 9 8 7 6 5 4 3 2 14 =0 
1 1 0 0 1 1 1 0 0 0 1 0 0 0 0 60 


The RTXM instruction resets the TXM status bit, which configures the serial 
port transmit section in a mode where it is controlled by an FSX (external 
framing pulse). The transmit operation is started when an external FSX 
pulse is applied. TXM may also be loaded by the LST1 and STXM in- 
structions. 


1 


eat ee ee ee ee ee 
Et oe es 
a I eT 

ote we ane 


RTXM TXM is reset, configuring FSX as an input. 
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RXF Reset External Flag RXF 


Syntax [<label>] RXF 
Operands None 


Execution (PC) +17 PC 
O > XF external flag pin and status bit 


Affects XF. 


Encoding 15 14 13 12 1110 9 8 7 6 5 4 3 2 14 #90 
1 1 0 0 1 4 4 0 0 0 0 0 4 +4 #0 0 


Description The XF pin and XF status bit in status register ST1 are reset to logic zero. 
XF may also be loaded by the LST1 and SXF instructions. 


Words 1 


Cycles 
Cycle Timings for a Single Instruction 
PI/DI PI/DE PE/DI PE/DE PR/DI PR/D 
20 PAS safe See Ihe ee ie ED ells 2. ee le te 
'C25 
1: ee es (Oe Rs es 
‘coa{_n fn 
Example RXF XF pin and status bit are reset to logic zero. 
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SACH Store High Accumulator with Shift SACH 


Syntax 
Direct: [<label>] SACH <dma>{,<shift>] 
Indirect: [<label>] SACH {ind}[,<shift>[,<next ARP>]] 


Operands 0 < dma < 127 
0 < next ARP <7 
O < shift < 0, 1, or 4 (defaults to 0) on the TMS32020 
O < shift < 7 (defaults to 0) on the TMS320C25 


Execution (PG) #1 PC 
16 MSBs of (ACC) x 28hift > dma 


Not affected by SXM. 
Encoding 15) 14s 4S 12 9 70) 29 BS Oe OB Be MB ee A 


Direct:| 0 1 1 0 14 — Shift | Data Memory Address 
Indirect} 0 1 1 0 14 Shift | 1) See Section 4.1 


Description The SACH instruction copies the entire accumulator into a shifter, where it 
shifts the entire 32-bit number 0, 1, or 4 bits on the TMS32020, or any- 
where from 0 to 7 bits on the TMS320C25.It then copies the upper 16 bits 
of the shifted value into data memory. The accumulator itself remains un- 
affected. 


Words 1 


Cycles 

Pt | ad | tte | Sede | OT 
P41 | +d | tp | 2ap | 1 | tra | 
Pn | 2ntnd [ate | Snendtp | | 
pn tnd | inte ttntnd+p | on] ntnd | 


Example SACH DAT10,4 (DP = 4) 
or 
SACH *,4 If current auxiliary register contains 522. 


‘20 
* 'C25 


‘20 
‘C25 


Before Instruction After Instruction 


ACC >4208001 ACC >4208001 
Cc Cc 
Data Data 
Memory Memory >4208 


522 522 
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SACL 


Syntax 


Store Low Accumulator with Shift SACL 


Direct: [<label>] SACL <dma>[,<shift>] 
Indirect:. [<label>] SACL {ind}[,<shift>[,<next ARP>]] 


Operands 


Execution 


Encoding 


Direct: 


0 < dma < 127 

0 < next ARP < 7 

O < shift < 0,1, or 4 (defaults to 0) on the TMS32020 
0 < shift < 7 (defaults to 0) on the TMS320C25 


(PC) +1 7PC 
16 LSBs of (ACC) x 2shift + dma 


Not affected by SXM. 
15 14 13 12 11 10 9 8 7 6 5 


A> <3 2. Ae’ 0 
pot to of shit Data Memory Address 


Indirect:|o 1 1 0 Of Shift | 1] _ See Section 4.1 


Description 


Words 
Cycles 


Example 


4-144 


‘20 
"C25 


‘20 
"C25 


The low-order bits of the accumulator are shifted left 0, 1, or 4 bits on the 
TMS32020 or anywhere from 0 to 7 bits on the TMS320C25, as specified 
by the shift code, and stored in data memory. The low-order bits are filled 
with zeros, and the high-order bits are lost. The accumulator itself is unaf- 
fected. 


1 


PI/DE 
Se ss 
po ted tte atop Pt aed 


Pn _2ntnd [n+p | anendtp | - [> 

pn | ning | ntp | t+nrndtp] n | nen _| 

SACL DAT11,1 (DP = 4) 

or : 

SACL #*,1 If current auxiliary register contains 523. 
Before Instruction After Instruction 


ACC >7C638421 ACC >7C638421 
Cc Cc 
Data Data 


523 523 


SAR Store Auxiliary Register SAR 


Syntax 


Direct: [<label>] SAR <AR>,<dma> 
Indirect: [<label>] SAR <AR>,{ind}[,<next ARP>] 


Operands 


Execution 


Encoding 


O < dma < 127 
O < auxiliary register AR < 7 
O< next ARP <7 


(PG) ae Pe 
(AR) ~ dma 


15 14 13 12 100 9 8 7 6 5 4 3 2 1 +0 


11 
Direct:| 0 1 1 1 0 Data Memory Address 
1 1 1 0 


R 
indirect: [ 0 See Section a 


Description 


Words. 
Cycles 


Example 1 


‘20 
‘C25 


‘20 
‘C25 


The contents of the designated auxiliary register (AR) are stored in the ad- 
dressed data memory location. ; 


When modifying the contents of the current auxiliary register in the indirect 
addressing mode, SAR ARn (when n = ARP) stores the value of the auxil- 
iary register contents before it is incremented, decremented, or indexed by 
ARO. 


1 


PI/DE 
ae ae ae a ee ee 
ee 

pon | antnd | inte | 3n+nd+p | 
pon | ntnd inte | t+ntnd+p | 


SAR ARO,DAT30 (DP = 6) 
or 


SAR ARO,* If current auxiliary register contains 798. 
Before Instruction After Instruction 
ARO >37 ARO >37 
Data Data 
Memory >18 Memory >37 
798 798 
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SAR Store Auxiliary Register SAR 


Example 2 LARP ARO 
SAR ARO, *0+ 


ARO ARO 

Data Data 

Memory Memory 
025 


1025 
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Subtract from Accumulator 


SBLK Long Immediate with Shift SBLK 
Syntax [<labeli>] SBLK <constant>[,<shift>] 
Operands 16-bit constant 


O < shift < 15 (defaults to 0) 


Execution (PC) + 2-2 PC ; 
(ACC) - [constant x 2Shift} + ACC 


If SXM = 1: 
Then -32768 < constant < 32767. 
lf SXM = 0: 


Then O < constant < 65535. 


Affects OV; affected by OVM and SXM. 
Affects C (TMS320C25). 


Encoding 15 14 13 12 11 10 9 8 7 6 5 4 3:2 14 O 


110 1] Sh [0 0 0 0 0 0 11 
16-Bit Constant 


Description The immediate field of the instruction is subtracted from the accumulator. 
The result replaces the accumulator contents. SXM determines whether the 
constant is treated as a signed two’s-complement number or as an un- 
signed number. The shift count is optional and defaults to zero. 


Words 2 


Cycles 
SS 
PSC repeatable CCC” 


Example SBLK 5712 


‘20 
"C25 


‘20 
'C25 


Before Instruction After Instruction 


C Cc 
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Subtract from Auxiliary Register 


SBRK Short Immediate (TMS320C25) SBRK 
Syntax [<label>] SBRK <constant> 
Operands O < constant < 255 


Execution (PC) + 1 -*-PE 
AR(ARP) - 8-bit positive constant ~ AR(ARP) 


Encoding 15 14 13 12 1110 9 8 7 6 5 4 3 2 1 #0 


fot tt tt tt] «BBR Constant id 


Description The 8-bit immediate value is subtracted, right-justified, from the currently 
selected auxiliary register with the result replacing the auxiliary register 
contents. The subtraction takes place in the ARAU, with the immediate 
value treated as an 8-bit positive integer. 


Cycle Timings for a Single Instruction 
PI/DI PI/DE PE/DI PE/DE PR/DI PR/DE 


Cycle Timings for a Repeat Execution 
‘C25 


Example SBRK >FF (ARP = 7) 


Before Instruction After Instruction 


AR7 AR? > FFON 
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SC 


Syntax 
Operands 


Execution 


Encoding 


Description 


Words 
Cycles 


Set Carry Bit (TMS320C25 SC 
[<label>] SC 
None 
(PC) +17 PC 
1 > carry bit C in status register ST1 
Affects C. 


15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 +0 
1 1 0 0 14 4 14 0 0 0 4 4 O09 0 O 1 


The carry bit C in status register ST1 is set to logic one. The carry bit may 
also be loaded directly by the LST1 and RC instructions. 


1 


Example 


sc Carry bit C is set to logic one. 
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SEL 


Syntax 
Operands 


Execution 


Encoding 


Shift Accumulator Left SFL 


[<label>] SFL 
None 


TMS32020: 


(PC) +17 PC 
(ACC(30-0)) > ACC(31-1) 
0 > ACC(0) 


Not affected by SXM bit. 


_ TMS320C25: 


Description 


Words 
Cycles 


Example 
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‘20 
"C25 


‘20 
"C25 


(PC) 4t--PC 

(ACC(31)) 7 C 
(ACC(30-0)) > ACC(31-1) 
0 ~ ACC(0) 

Affects C. 

Not affected by SXM bit. 


15 14 13 12 1110 9 8 7 6 5 4 3 2 14 +0 
1 1 0 0 1 1 1 0 0 0 0 1 1 0 0 60 


The SFL instruction shifts the entire accumulator left one bit. The least- 
significant bit is filled with a zero. On the TMS32020, the most-significant 
bit is lost. On the TMS320C25, the most-significant bit is shifted into the 
carry bit (C). Note that SFL, unlike SFR, is unaffected by SXM. 


1 


Se GC OD 
Et 4 ee ee oe 
a OS 
SS OS 


SFL 


Before Instruction ae coe Instruction 


Cc 


SFR Shift Accumulator Right SFR 


Syntax | 
Operands 


Execution 


Encoding 


Description 


Words 
Cycles 


af 1 [1 [|e | tp [| - | - | 
os 1+ | 1 | tp | ts» ] 1 | 7 1 
zo[ nm ~ . | op [| mp | - | - | 
asf» | | neo | nep_ | nd 


[<label>] SFR 
None 
T™TMS32020: 


lf SXM = 0: 
Then (ACC(31-1)) ~ ACC (30-0) and 0 ~ ACC(31). 

if SXM = 1: 
Then (ACC(31-1)) > ACC(30-0) and (ACC(31)) ~ ACC(31). 


Affected by SXM bit. 


TMS320C25: 
(PC) +17 PC 
If SXM = 0: 
Then (ACC(0)) *C 
(ACC(31-1)) ~ ACC (30-0) and 0 ~ ACC(31). 
If SXM = 1: 
Then (ACC(0)) 7 C 
(ACC(31-1)) ~ ACC(30-0) and (ACC(31)) ~ ACC(31). 
Affects C. 
Affected by SXM bit. 


15 14 13 12 1110 9 8 7 6 5 4 3 2 1 QO 
1 1 0 0 14 4 4 0 0 0 0 4 #4 0 0 1 


The SFR instruction shifts the accumulator right one bit. If SXM = 1, the 
instruction produces an arithmetic right shift. The sign bit (MSB) is un- 
changed and is also copied into bit 30. Bit 0 is shifted into the carry bit (C). 


if SXM = 0, the instruction produces a logical right shift. All of the accu- 
mulator bits are shifted by one bit to the right. The least-significant bit is 
shifted into the carry bit, and the most-significant bit is filled with a zero. 


On the TMS32020, note that bit 0 is lost. 
1 


Cycle Timings for a Single Instruction 
PI/DI PI/DE PE/DI PE/DE PR/DI PR/DE 


Cycle Timings for a Repeat Execution 
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SER Shift Accumulator Right SER 


Example 1 SFR (SXM = 0) 


Before Instruction 


Cc 


Example 2 SFR (SXM = 1) 


Cc 
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After Instruction 


ACC [o| >5800091A 
C 


ACC (o| >D800091A 
Cc . 


Set Serial Port Frame 


SFSM Synchronization Mode (TMS320C25 SFSM 
Syntax [<label>] SFSM 
Operands None 


Execution (PC > PC 
1 ~ FSM status bit in status register ST1 


Affects FSM. 


Encoding 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 +0 
1 1 0 oOo 1 1 1 0 oO oO 1 1 Oo. 61 1 1 


Description The SFSM instruction sets the FSM status bit to logic one. In this mode, 
an external FSR pulse is required for a receive operation, and an external 
FSX pulse is required if TXM = 0. If TXM = 1, FSX pulses are generated in 
the normal manner every time the transmit shift register XSR is loaded. See 
Section 3.7 for details on the operation of the serial port. FSM may also be 
loaded by the LST1 and RFSM instructions. 


Words ai 


as aes ee ee ee ee ee 
Pe te te 


Example SFSM FSM is set, putting the serial port ina 
mode of operation where frame synchronization 
pulses are required for each word to be 
transmitted or received. 


Cycles 


‘C25 


"C25 


4-153 


SH Set Hold Mode (TMS320C25) SHM 


Syntax .[<label>] SHM 
Operands None 


Execution (PG) -2°-41.5> PC 
1 > HM status bit in status register ST1 


Affects HM. 


Encoding 15 14 13 12 1110 9 8 7 6 5 4 3 2 1 #90 
1 1 0) 0 1 1 1 0 0 0 1 1 1 0 0 1 


Description The SHM instruction sets the HM status bit to logic one. In this mode, the 
TMS320C25 is halted in the normal manner whenever HOLD is asserted, 
regardless of the PC value or the state of the MP/MC pin. HM may also be 
loaded by the LST1 and RHM instructions. 


na i ee ee 


Example SHM HM is set, implementing the normal 
(TMS32020-type) hold mode of operation. 
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SOVM 


Syntax 
Operands 


Execution 


Encoding 


Description 


Words 
Cycles 


Example 


‘20 
‘C25 


‘20 
’C25 


Set Overflow Mode SOVM 


[<label>] SOVM 
None 


(PC) +17 PC 
1 > overflow mode (OVM) status bit 


Affects OVM.. 


15 14 13 12 1110 9 8 7F 6 5 4 3 2 1 #90 
1 1 0 0 1 1 1 0 0 0 0 0 0 0 1 1 


The OVM status bit is set to logic one, which enables the overflow (satu- 
ration) mode. If an overflow occurs with OVM set, the overflow flag OV is 
set, and the accumulator is set to the largest representable 32-bit positive 
(>7FFFFFFF) or negative (>80000000) number according to the direction 
of overflow. 


OVM may also be loaded by the LST and ROVM instructions. 
1 


p41 1 | tp | i | - | - | 
a ee ee ee 
[a [nm] np | np | - 4 

[nf | _ntp | —nep [a | 9 


SOVM The overflow mode bit OVM is set, enabling 
the overflow mode on any subsequent 
arithmetic operations. 
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SPAC Subtract P Register from Accumulator SPAC 


Syntax [<label>] SPAC 
Operands None 
Execution (PC) + 1.> PC 


(ACC) - (shifted P register) ~ ACC 


Affects OV; affected by PM and OVM. 
Affects C (TMS320C25). 
Not affected by SXM. 


Encoding 15 14 13 12 1110-9 8 7 6 5 4 3 2 1 =«0 
1 1 0 0 1 1 1 0 0 0 0 1 0 1 1 #O 


Description The contents of the P register, shifted as defined by the PM status bits, are 
subtracted from the contents of the accumulator. The result is stored in the 
accumulator. Note that SPAC is unaffected by SXM; the P register is always 
sign-extended. 


The SPAC instruction is a subset of LTS, MPYS, and SORS. 
Words 1 


Cycles 
Cycle Timings for a Single Instruction 
PI/DI PI/DE PE/DI PE/DE PR/DI PR/DE 
Example -SPAC (PM = 0) 


Before Instruction After Instruction 


ace hI >ac] acc [>a 


Cc Cc 
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SPH Store High P Register (TMS320C25 SPH 


Syntax 
. Direct: [<label>] SPH <dma> 
Indirect: [<label>] SPH {ind}[,<next ARP>] 


Operands O < dma < 127 
O < next ARP < 7 


Execution (PC) +17 PC 
(PR shifter output (31-16)) ~ dma 


Affected by PM. 
Encoding 15 14 13 12 1110 9 8 7 6 5 4 3 2 1 =O 


Direct:| O 1 1 1 1 1 0 1 Data Memory Address 
Indirect:|0 1 1 #14 #1 #1 +0 1414] See Section 4.1 


Description The high-order bits of the P register, shifted as specified by the PM bits, 
are stored in data memory. Neither the P register nor the accumulator are 
affected by this instruction. High-order bits are sign-extended when the 
right-shift by 6 mode is selected. Low-order bits are taken from the low P 
register when left-shifts are selected. 


Cycle Timings for a Single Instruction 
PI/DI PI/DE PE/DI PE/DE PR/DI PR/DE 


Example SPH DAT3 (DP = 4, PM = 2) 
or 
SPH 7 If current auxiliary register contains 515. 
Before Instruction After Instruction 
P | >FEO79844 P >FE0O79844 
Data Data 
Memory Memory >E079 
515 515 
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SPL Store Low P Register (TMS320C25 SPL 


Syntax 
Direct: [<label>] SPL <dma> 
Indirect: [<label>] SPL {ind}[,<next ARP>] 


Operands 0 < dma s 127 
O < next ARP < 7 


Execution (PC) + 42>7.PC 
(PR shifter output (15-0)) > dma 


Affected by PM. 
Encoding 15 14 13 12 1110 9 8 7 6 5 4 3 2 1 90 
0 


Direct:} 0 1 1 1 1 1 0 Data Memory Address 
Indirect:|} 0 4 17 #1 1 =+14 ~0 oOf1 | See Section 4.1 


Description The low-order bits of the P register, shifted as specified by the PM bits, are 
stored in data memory. Neither the P register nor the accumulator are af- 
fected by this instruction. High-order bits are taken from the high P register 
when the right-shift by 6 mode is selected. Low-order bits are zero-filled 
when left-shifts are selected. 


Words 1 
Cycles 


Cycle Timings for a Single Instruction 
PI/DI PI/DE PE/DI PE/DE PR/DI PR/DE 


1025 
Cycle Timings for a Repeat Execution 
"C25 
Example SPL DAT3 (DP =-4,. PM = 2) 
or 
SPL 7 If current auxiliary register contains 515. 


Before Instruction After Instruction 


P > FEO79844 | P >FEO79844 
Data Data 
ee >4567 Nene >8440 
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SPM Set P Register Output Shift Mode SPM 


Syntax [<label>] SPM <constant> 
Operands O < constant < 3 
Execution (PC) + 1° PC 
Constant ~ product register shift mode (PM) status bits 
Affects PM. 
Encoding 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 +90 


1 3 1°0 0 1 4 +4 0 0 0 0 0 1 Of PM | 


Description The two low-order bits of the instruction word are copied into the PM field 
of status register ST1. The PM status bits control the P register output 
shifter. This shifter has the ability to shift the P register output either one 
or four bits to the left or six bits to the right, or to perform no shift. The bit 
combinations and their meanings are shown below. 


PM ACTION 

00 No shift of multiplier output 

01 Output left-shifted 1 place and zero-filled 

10 Output left-shifted 4 places and zero-filled 

11 Output right-shifted 6 places, sign-extended; LSB bits lost. 


The left-shifts allow the product to be justified for fractional arithmetic. The 
right-shift by six bits has been incorporated to implement up to 128 multi- 
ply-accumulate processes without the possibility of overflow occurring. PM 
may also be loaded by an LST1 instruction. 


Words 4 


Cycles 

ke i ee ee 
SS CT 
PF SC~S*~Ctnotrepeatabie dC CCC CS” 


Example SPM 3 Product register shift mode 3 is selected, 
causing all subsequent transfers from the 
product register to the ALU to be shifted 
to the right six places. 


‘20 
‘C25 


‘20 
'C25 
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SQRA Square and Accumulate Previous Product SQRA 


Syntax 
Direct: [<label>] SQRA <dma> 
Indirect: [<label>] SQRA {ind}[,<next ARP>] 


Operands 0 < dma s 127 
O < next ARP <7 


Execution (PC) +17 PC 
(ACC) + (shifted P register) > ACC 
(dma) ~ T register 
(dma) x (dma) > P register 


Affects OV; affected by PM and OVM. 
Affects C (TMS320C25). 


_ Encoding 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 #90 


Direct:| 0 O 1 1 1 0 oOo 1 Data Memory Address 
Indirect} 0 0 1 #14 #14 #0 0 141 | See Section 4.1 


Description The contents of the P register, shifted as defined by the PM status bits, are 
added to the accumulator. The addressed data memory value is then loaded 
into the T register, squared, and stored in the P register. 


Words 1 
Cycles 


| 
PR/D 
zof_1 | 2a | tp | 2-ep | - | - 
cas{__41 | 2a | +p | 2+dep | 1 | 2a 
20[7 a [arena [np | anenaep | - [> 
can | t+nend | nep | 1+nend+p| n | tenend 


Example SQRA DAT30 (DP = 6, PM = 0) 
or 
SQRA * If current auxiliary register contains 796&. 
Before Instruction After Instruction 
Data Data 
798 798 
T T 
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SQRS Square and Subtract Previous Product SORS 


Syntax 
Direct: [<label>] SQRS <dma> 
indirect: [<label>] SQRS {ind}[,<next ARP>] 


Operands 0 < dma < 127 
O < next ARP < 7 


Execution (PC) +17 PC 
(ACC) - (shifted P register) ~ ACC 
(dma) > T register 
(dma) x (dma) ~ P register 


Affects OV; affected by PM and OVM. 
Affects C (TMS320C25). 


Encoding 15 14 13 12 1110 9 8 7 6 5 4.3 2 1 =#0 


Direct:|0 1 0 7 #4 0 4 0] 0 [- Data Memory Address 
Indirect} 0 1 #0 1 #1 +0 1 + #«Of1 | See Section 4.1 


Description The contents of the P register, shifted as defined by the PM status bits, are 
subtracted from the accumulator. The addressed data memory value is then 
loaded into the T register, squared, and stored into the P register. 


Words 1 


Cycles 
Cycle Timings for a Single Instruction 
PI/DI PI/DE PE/DI PE/DE PR/DI PR/DE . 
20 fone eid pe 2tdipe 
‘czp{ 1 | td | tte | edt | ted 
Cycle Timings for a Repeat Execution 
20{__n | anend | ntp | 2nendep | - | 
cos [___n | t+nend | ntp [t+nend+p] nm | 1ontnd | 
Example SQRS DATS (DP = 6, PM = 0) 
or 
SORS 7 If current auxiliary register contains 777. 
Before Instruction After Instruction 
Data Data 
Memory > Memory 
777 777 
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SST Store Status Register STO SST 


Syntax 


Direct: [<label>] SST <dma> 
Indirect: [<label>] SST {ind}{,<next ARP>] 


Operands O < dma < 127 


O < next ARP < 7 


Execution (PC}):+ 1°: PC 


Encodi 


(status register STO) ~ dma 
ng 15 14 13 12 1110 9 8 7 6 5 4 3 2 1 =0 


Direct:| 0 1 1 1 1 0 oO O Lo | Data Memory Address 


Indirect] 0 1 #1 #1 #1 #0 0 Of1 | See Section 4.1 


Description Status register STO is stored in data memory. 
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In the direct addressing mode, status register STO is always stored in page 
O regardless of the value of the DP register. The processor automatically 
forces the page to be 0, and the specific location within that page is defined 
in the instruction. Note that the DP register is not physically modified. This 
allows storage of the DP register in the data memory on interrupts, etc., in 
the direct addressing mode without having to change the DP. In the indi- 
rect addressing mode, the data memory address is obtained. from the auxil- 
iary register selected. (See the LST instruction for more information.) 


The SST instruction can be used to store status register STO after interrupts 
and subroutine calls. The STO contains the status bits: OV (overflow flag) 
bit, OVM (overflow mode) bit, INTM (interrupt mode) bit, ARP (auxiliary 
register pointer) bit, and DP (data memory page pointer) bit. The status 
bits are stored in the data memory word as follows: 


15 13. 12 1 8 7 6 5 4 3 2 1 0 


14 1 10 9 
A 


Note that SST * may be used to store status register STO anywhere in data 
memory, while SST in the direct addressing mode is forced to page 0. 


Cycle Timings for a Single Instruction 
PI/DI PI/DE PE/DI PE/DE PR/DI PR/DE 


C28 
20} nf antnd inte | antndtp | = 
25] in| tnd nt ttntndtp |] on | ntnd 


SST Store Status Register STO SST 


Example SST DAT96 (DP = don't care) 
or 
SST * If current auxiliary register contains 96. 
Before Instruction After Instruction 


Status 


Status 
Realter >A408 Register >A408 
STO STO 


Data Data 


Memory Memory >A408 
96 96 
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SST1 Store Status Register ST1 SST1 


Syntax 
Direct: [<label>] SST1 <dma> 
Indirect: [<label>] SST1 {ind}[,<next ARP>] 


Operands 0 < dma < 127 
O < next ARP < 7 


Execution (PC) +1 PC 
(status register ST1) > dma 


Encoding 15 14 13 12 1110 9 8 7 6 5 4 3 2 1 #90 


Direct:{O_ 1 1 1 1 0 0 14) Data Memory Address 
a nee on oe ee See Section 4.1 


Description Status register ST1 is stored in data memory. In the direct addressing mode, 
status register ST1 is always stored in page 0 regardless of the value of the 
DP register. The processor automatically forces the page to be 0, and the 
specific location within that page is defined in the instruction. Note that 
the DP register is not physically modified. This allows the storage of the 
DP in the data memory on interrupts, etc., in the direct addressing mode 
without having to change the DP. In the indirect addressing mode, the data 
memory address is obtained from the auxiliary register selected. (See the 
LST1 instruction for more information.) 


Indirect: 


SST1 is used to store status bits after interrupts and subroutine calls. ST1 
contains the status bits: ARB (auxiliary register pointer buffer), CNF (RAM 
configuration control) bit, TC (test/control) bit, SXM (sign-extension 
mode) bit, XF (external flag) bit, FO (serial port format) bit, TXM (transmit 
mode) bit, and the PM (product register shift mode) bit. ST1 on the 
TMS320C25 also contains the status bits: C (carry) bit, HM (hold mode) 
bit, and FSM (frame synchronization mode) bit. The bits loaded into status 
register ST1 from the data memory word are as follows: 


15 14 13 12 1 


1 10 9 8 7 6 5 4 3 2 1 0 
Te [sxm] ct | 11 jet | eset} xe | FO] TxM] PM | 


tOn the TMS32020, bits 5, 6, and 9 are one’s. 


Note that SST1 * may be used to store status register ST1 anywhere in data 
memory, while SST1 in the direct addressing mode is forced to page 0. 


[1 [2a | i+p | seaep | - | 

cos{_1 | tea | +p | 2edep | 1 | 4a] 
20{_a | 2nend | ntp | anendep | - | 

c25 [__n | ntnd | _nep | t+nend+p] | 
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SST1 Store Status Register ST1 SST1 


Example SST1 DAT97 (DP = don't care) 
SST1 * If current auxiliary register contains 97. 
Before Instruction After Instruction 
Status Status 
negate >A7EO Rega >A7E0O 
1 T1 
Data Data 


Memory Memory >A7EO 
97 97 
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SSXM Set Sign-Extension Mode SSXM 


Syntax [<label>] SSXM 
Operands None 


Execution (PC) + 74> PC 
1 ~ SXM status bit in status register ST1 


Affects SXM. 


Encoding 15:14, 13-12-44 100 SB. Fe G6 UB A OB A? oD 
1 14 0 0 4 4 4 0.0 0 0 0 0 +4 +4 =~ 1 


Description The SSXM instruction sets the SXM status bit to logic 1, which enables 
sign-extension on shifted data memory values for the following arithmetic 
instructions: ADD, ADDT, ADLK, LAC, LACT, LALK, SBLK, SUB, and. 
SUBT. . 


SSXM also affects the definition of the SFR instruction. SXM may also be 
loaded by the LST1 and RSXM instructions. 


‘Words 


— 


Cycles 


+ + [tt [ mw | - ~— - 1 
Pe ee ee 
[—. [ | mp | np | - ] - 
[nf nf op [nee Pn 


Example SSXM SXM is set, enabling sign extension on 
subsequent instructions. 


‘20 
‘C25 


‘20 
’C25 
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STC 


‘Syntax 
Operands 


Execution 


Encoding 


Set Test/Control Flag (TMS320C25 STC 


[<label>] SSXM 
None 


(PC) + 17> PC 
1 > TC test/control flag in status register ST1 


Affects TC. 


15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 =#=90 
1 1 o oO 1 1 1 0 oO oO 1 1 oO oO 1 1 


Description The TC (test/control) flag in status register ST1 is set to logic one. TC may 


Words 
Cycles 


Example 


also be loaded by the LST1 and RTC instructions. 
1 


Po ee ep ea 
eer 


STC TC (test/control) flag is set to logic one. 


4-167 


STXM 


Syntax 
Operands 


Execution 


Encoding 


Description 


Words 
Cycles 


Example 
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‘20 
‘C25 


‘20 
‘C25 


Set Serial Port Transmit Mode STXM 


[<label>] STXM 
None 


(PC) +17 PC 
1 ~ TXM status bit in status register ST 


Affects TXM. 


15 14 13 12 1110 9 8 7 6 5 4 3 2 1 90 
1 $1 0 0 14 141 4 0 0 0 14 0 0 0 0 147 


The STXM instruction sets the TXM status bit to logic 1, which configures 
the serial port transmit section to a mode where the FSX pin behaves as an 
output. A pulse is produced on the FSX pin each time the DXR register is 
loaded internally. The transmission is initiated by the negative edge of this 
pulse. TXM may also be loaded by the LST1 and RTXM instructions. If the 
FSM status bit is a logic zero and serial port operation has already started, 
the FSX pin will be driven low if TXM = 1. 


1 


Pf 4 ee | 
[+f 1 |» [|p | 1 | 7 | 
OS 
[nf tp fp | dP 


STXM TXM is set, configuring FSX as an output. 


SUB 


Syntax 


Direct: 
Indirect: 


Operands 


Execution 


Encoding 


Direct: 


Ww 


Subtract from Accumulator with Shift SU 


[<label>] SUB <dma>[,<shift>] 
[<label>] SUB {ind}{,<shift>[,<next ARP>]] 


0 < dma < 127 
O < next ARP < 7 
O < shift < 15 (defaults to 0) 


(PC) +1 > PC . 

(ACC) - [(dma) x 28hift] + Acc 
lf SXM = 1: 

Then (dma) is sign-extended. 


If SXM = 0: 
Then (dma) is not sign-extended. 


Affects OV; affected by OVM and SXM. 
Affects C (TMS320C25). 


15 14 13 12 1110 9 8 7 6 5 


4 3 2 1 =O 
poo Oo tf shit Data Memory Address 


Indirect:| o_o o 1{ shit || See Section 4.1 


Description 


Words 
Cycles 


Example 


"20 
‘C25 


‘20 
"C25 


The contents of the addressed data memory location are left-shifted and 
subtracted from the accumulator. During shifting, low-order bits are zero- 
filled. High-order bits are sign-extended if SXM = 1 and zero-filled if SXM 
= 0. The result is stored in the accumulator. 


1 


Pipl 
[4 | aed | tp | 2edp | - | - 
[41 | 2a | tp | 2eaep | 1 
[| 2nend [nt | anéndtp | 
Tn | tenend | nep | tenenatp| a 


SUB DAT80 (DP = 8) 


or 
SUB * If current auxiliary register contains 1104. 
Before Instruction After Instruction 
Data Data 
Memory >11 Memory >11 
1104 1104 
ACC >24 ACC >13 
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Subtract from Accumulator 


SUBB with Borrow (TMS320C25) SUBB 


Syntax 


Direct: [<label>] SUBB <dma> 


Indirect: [<label>] SUBB {ind}[,<next ARP>] 


Operands 


0 < dma < 127 


~O < next ARP <7 


Execution 


Encoding 


Indirect: 


Descript 


Words 
Cycles 


Example 


4-170 


Direct: 


ion 


‘C25 


'C25 


(PC) +1 >PC 
(ACC) - (dma) - (€) > ACC 


Affects C and OV; affected by OVM. 
15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 #90 


0 1 10) 0 1 1 ; | Data Memory Address 
ee ee ee a ee ee See Section 4.1 


The contents of the addressed data memory location and the value of the 
carry bit are subtracted from the accumulator. The carry bit is then affected 
in the normal manner (see Section 3.5.2). 


1 


P11 2d [ 1+» | 2+dep [1 [24d 
[nL tenend | n+p [tenendtp] mn | tener 


SUBB DATS (DP = 8) 


or 
SUBB * If current auxiliary register contains 1029. 
Before Instruction After Instruction 
Data Data 
1029 1029 


acc |o] acc [o] [_>FFFFFFFF 
c C 


In the above example, C is originany zeroed, presumably from the result of 
a previous subtract instruction that performed a borrow. The effective op- 
eration performed was 6 - 6 - (0) = -1, generating another borrow (and 
resetting carry again) in the process. 


The SUBB instruction can be used in performing multiple-precision arith- 
metic. 


SUBC 


Syntax 


Conditional Subtract SUBC 


Direct: [<tabel>] SUBC <dma> 
Indirect: [<label>] SUBC {ind}[,<next ARP>] 


Operands 


Execution 


Encoding 


Direct: 


Indirect: 


Description 


Words 
Cycles 


‘20 
‘C25 


‘20 
"C25 


O < dma < 127 
O < next ARP < 7 


(PC) +1 >PC 
(ACC) - [(dma) x 215] > ALU output 
lf ALU output > 0: 


Then (ALU output) x 2 + 1 7 ACC; 
Else (ACC) x 2 > ACC. 


Affects OV. 
Affects C (TMS320C25). 
Not affected by OVM (no saturation) or SXM. 


15 14 13 12 11 10 9 8 7 6 5 4 3 2° 1 =#90 


Oo tle 0° Oe. 0 te ne Ad Well Data Memory Address 


1 See Section 4.1 


The SUBC instruction performs conditional subtraction, which may be used 
for division. The 16-bit dividend is placed in the low accumulator, and the 
high accumulator is zeroed. The divisor is in data memory. SUBC is exe- 
cuted 16 times for 16-bit division. After completion of the last SUBC, the 
quotient of the division is in the lower-order 16-bit field of the accumulator, 
and the remainder is in the high-order 16 bits of the accumulator. SUBC 
assumes the divisor and the dividend are both positive. 


If the 16-bit dividend contains less than 16 significant bits, the dividend 
may be placed in the accumulator left-shifted by the number of leading 
non-significant zeroes. The number of executions of SUBC is reduced from 
16 by that number. One leading zero is always significant. 


Note that SUBC affects OV but is not affected by OVM, and therefore the . 
accumulator does not saturate upon positive or negative overflows when 
executing this instruction. 


1 


Pt aed | top [| aeaep | - 
Pf aed ftp aap | 
[a] 2nend | ntp | 2nendvp |---| 
Pn [ tenend | _nep[tenendep| nm | 1ennd_| 
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SUBC 


Example 
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Conditional Subtract SUBC 
15 
DAT2 (DP = 4) 
15 
® If current auxiliary register contains 514. 
Before Instruction Atter Instruction 


Data 


Data 
Cc Cc 


SUBH Subtract from High Accumulator SUBH 


Syntax 


Direct: [<label>] SUBH <dma> 
Indirect: [<label>] SUBH {ind}[,<next ARP>] 


Operand 


Ss 


Execution 


Encoding 


Direct:} 0 1 0 oO O i 0 Data Memory Address 
Indirect:|0 1 © 0 0 1. O En See Section 4.1 


Descript 


Words 
Cycles 


Example 


ion 


‘20 
‘C25 


‘20 
‘C25 


0 < dma s 127 
O < next ARP s 7 


(PC) +1—7PC 
(ACC) - [(dma) x 218] > ACC 
Affects OV; affected by OVM. 
Affects C (TMS320C25). 


15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 #9 
ce) 


| 


The contents of the addressed data memory location are subtracted from the 
upper 16 bits of the accumulator. The 16 low-order bits of the accumulator 
are unaffected. The result is stored in the accumulator. The carry bit C on 
the TMS320C25 is reset if the result of the subtraction generates a borrow; 
otherwise, C is unaffected. 


The SUBH instruction can be used for performing 32-bit arithmetic. 
1 


PRIDE 
[1 | 2a | 1 | 2p | - | - 
= 


[a | 2nend [ep | anendep | - -+[ 
Pn [ tentna | ntp | tenendep| nn | tenend 
SUBH DAT33 (DP = 6) 
or 
SUBH bi If current auxiliary register contains 801. 
Before Instruction After Instruction 
Data Data 
801 801 


G Cc 
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Subtract from Accumulator 


SUBK Short Immediate (TMS320C25) SUBK 
Syntax [<label>] SUBK <constant> 
Operands O < constant < 255 


Execution (PC) +17 PC 
(ACC) - 8-bit positive constant ~ ACC 


Affects C and OV: affected by OVM. 
Not affected by SXM. 


Encoding 15 14 13 12.11 10 9 8 7 6 5 4 3. 2 1 0 
1 1 0 0 1 1 0 1 8-Bit Constant. 


Description The 8-bit immediate value is subtracted, right-justified, from the. accu- 
mulator with the result replacing the accumulator contents. The immediate 
value is treated as an 8-bit positive number, regardless of the value of SXM. 


Words 1 
Cycles 


oA rs 


Example SUBK >12 


‘C25 


Before Instruction After Instruction 


Cc C 
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Subtract from Low Accumulator 


SUBS with Sign-Extension Suppressed SUBS 
Syntax 


Direct: [<label>] SUBS <dma> 
Indirect: [<label>] SUBS {ind}[,<next ARP>] 


Operand 


Ss 


Execution 


Encoding 


Indirect: 


Descript 


Words 
Cycles 


Example 


Direct: 


ion 


‘20 
‘C25 


‘20 
‘C25 


O < dma s 127 

O < next ARP < 7 

(PC) +17 PC 

(ACC) - (dma) > ACC 
Affects OV; affected by OVM. 


Affects C (TMS320C25). 
Not affected by SXM. 


15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 90 
41 


0 1 0 (¢) 0 1 0 Data Memory Address 
1 ee ee ee eee See Section 4.1 


The contents of the addressed data memory location are subtracted from the 
accumulator with sign-extension suppressed. The data is treated as a 16- 
bit unsigned number, regardless of SXM. The accumulator behaves as a 
signed number. SUBS produces the same result as a SUB instruction with 
SXM = 0 and a shift count of 0. 


1 


[tase | tp | aedep | - 
Se 
[| anena [n+p | 2néndep | --[- 
rom ama eae! 


SUBS DAT2 (DP = 16) 
or 
SUBS 7 If current auxiliary register contains 2050. 
Before Instruction , After Instruction 
Data Data 
Memory >FO03 Memory >F003 
2050 2050 
Before Instruction After Instruction 


C G 
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Subtract from Accumulator 


SUBT with Shift Specified by T Register SUBT 
Syntax 


Direct: [<label>] SUBT <dma> 
Indirect: [<label>] SUBT {ind}[,<next ARP>] 


Operand 


s 


Execution 


Encoding 


Descript 


Words 
Cycles 


Example 
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Direct: 


ion 


'20° 


‘C25 


‘20 
‘C25 


O<dma < 127 
O < next ARP < 7 


(PC) +17 PC ; 
(ACC) - [(dma) x oT register(3-0)} > (ACC) 
lf SXM = 1: . 
Then (dma) is sign-extended. 
If SXM = 0: 
Then (dma) is not sign-extended. 


Affects OV; affected by SXM and OVM. 
Affects C (TMS320C25). 


15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 #=90 
0 


Data Memory Address 


ii "0; 0h 0. oA 
Indirect} 0 1 0 0 0 1 1 a) See Section 4.1 


| 


The data memory value is left-shifted and subtracted from the accumulator. 
The left-shift is defined by the four LSBs of the T register, resulting in shift 
options from 0 to 75 bits. The result replaces the accumulator contents. 
Sign-extension on the data memory value is controlled by the SXM status 
bit. 


1 


ee ee ee 
pon | antnd | ntp | antndep | - T- 
ae pele 2) 


Tentndp 
SUBT° DAT127 (DP = 4) 
or 
SUBT * If current auxiliary register contains 639. 


Subtract from Accumulator 


SUBT with Shift Specified by T Register SUBT 
Before Instruction After Instruction 
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SXF Set External Flag SXF 


Syntax [<label>] SXF 
Operands None 


Execution (PC) +17 PC 
1 > external flag (XF) pin and status bit 


Affects XF. 


Encoding 156 14 13 12 11 10 9 8 7 6 5 4 3 2 #14 #90 
1 1 0 oO 1 1 1 0 0 0 0 0 1 1 Oo 1 


Description The XF pin and the XF status bit in status register ST1 are set to Jogic 1. 
XF may also be loaded by the LST1 and RXF instructions. 


Words | 


Cycles 
_ Cycle Timings for a Single Instruction 
PI/DI PI/DE PE/DI PE/DE PR/DI PR/DE 
Example SXF The XF pin and status bit are set to logic 1. 
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TBLR 


Syntax 
Direct: 
Indirect: 


Operands 


Execution 


Encoding 


Table Read TBLR 


[<label>] TBLR <dma> 
[<label>] TBLR {ind}[,<next ARP>] 


O < dma < 127 
O < next ARP < 7 


TMS32020: 


(PC) +1 > TOS 
(ACC(15-0)) > PC 


If (repeat counter) # 0: 
Then (pma) ~ dma, 
Modify AR(ARP) and ARP as specified, 
(PC) +1 PC, 
(repeat counter) - 1 > repeat counter. 


Else (pma) ~ dma 
Modify AR(ARP) and ARP as specified. 
(TOS) 7 PC 


TMS320C25: 


(PC) +17 PC 
(PFC) > MCS 
(ACC(15-0)) > PFC 


If (repeat counter) # 0: 

Then (pma, addressed by PFC) ~ dma, 
Modify AR(ARP) and ARP as specified, 
(PFC) + 1 > PFC, 

(repeat counter) - 1 ~ repeat counter. 


Else (pma, addressed by PFC) ~ dma 
Modify AR(ARP) and ARP as specified. 
(MCS) > PFC 


15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 #0 


Direct:}| O 1 0 1 1 0 Data Memory Address 


Indirect: 


Description 


Words 


Of 68 4. 4 6. 0 0/1 | See Section 4.1 


The TBLR instruction transfers a word from a location in program memory 
to a data memory location specified by the instruction. The program mem- 
ory address is defined by the low-order 16 bits of the accumulator. For this 
operation, a read from program memory is performed, followed by a write 
to data memory. When in the repeat mode, TBLR effectively becomes a 
single-cycle instruction, and the program counter that contains the ACCL 
is incremented once each cycle. On the TMS32020, the contents of the 
lowest stack location are lost when using the TBLR instruction. 


lf the MP/MC pin on the TMS320C25 is low at the time of execution of this 
instruction and the program memory address used is less than 4096, an 
on-chip ROM location will be read. 


1 
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TBLR Table Read TBLR 


Cycles 
Cycle Timings for a Single Instruction 
PI/DI PI/DE PE/DI PE/DE PR/DI PR/DE 
‘20 | Table in internal program memory: 
3 3+d 3+p 3+d+p 
Table in external program memory: 
3+p 4+d+p 3+2p 4+d+2p 
‘C25 | Table in on-chip RAM: 
2 2+d 3+p 3+d+p 
Table in on-chip ROM: 
3 3+d 4+p 4t+d+p 
Table in external memory: 
3+p 3+d+p 4+2p 4+d+2p 
‘20 | Table in internal program memory: 
2+n 2+n+nd 2+n+p 2+n+nd+p 
Table in external program memory: 
2+n+np 2+2n+nd 2+nt+np 2+2n+nd+np 
+np +p +p 
‘C25 | Table in on-chip RAM: 
1+n i+n+nd 2+n+p 2+n+nd+p 2+n+nd 
Table in on-chip ROM: 
2+n 2+n+nd 3+n+p 3+n+ndt+p 3+n 3+n+nd 
Table in external memory: 
2+n+np 1+2ntnd+np 3+ntnp 2+2ntndtnp 3+n+np 2+2n+nd 
+p +p +np 
Example TBLR DAT6 (DP = 4) 
TBLR * If current auxiliary register contains 518. 
Before Instruction After Instruction 
ACC >23 ACC >23 
Program Program 
Memory Memory 
23 23 
Data Data 
Memory >75 Memory 
518 518 
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TBLW 


Syntax 
Direct: 
Indirect: 


Operands 


Execution 


Encoding 


Direct: 


Indirect: 


Description 


Words 


Table Write TBLW 


[<label>] TBLW <dma> 
[<label>] TBLW {ind}[,<next ARP>] 


0 < dma < 127 
O < next ARP < 7 


TMS32020: 


(PC) +1 TOS 
(ACC(15-0)) + PC 


If (repeat counter) # 0: 
Then (dma) ~ pma, 
Modify AR(ARP) and ARP as specified, 
(PC) +17 PC, 
(repeat counter) - 1 ~ repeat counter. 


Else (dma) ~ pma 
Modify AR(ARP) and ARP as specified. 
(TOS) > PC 


TMS320C25: 


(PC) +17PC 
(PFC) > MCS 
(ACC(15-0)) > PFC 


lf (repeat counter) # 0: 

Then (dma, addressed by PFC) ~ pma, 
Modify AR(ARP) and ARP as specified, 
(PFC) + 1 ~ PFC, 

(repeat counter) - 1 ~ repeat counter. 


Else (dma, addressed by PFC) ~ pma 
Modify AR(ARP) and ARP as specified. 
(MCS) > PFC 


15 14 13 12 1110 9 8 7 6 5 4 3 2 1 =O 


0 1 0 1 1 1 Data Memory Address 


0 1 0 1 1 0 0 1]1 | See Section 4.1 


The TBLW instruction transfers a word in data memory to program memory. 
The data memory address is specified by the instruction, and the program 
memory address is specified by the lower 12 bits of the accumulator. A read 
from data memory is followed by a write to program memory to complete 
the instruction. When in the repeat mode, TBLW effectively becomes a 
single-cycle instruction, and the program counter that contains the ACCL 
is incremented once each cycle. On the TMS32020, the contents of the 
lowest stack location are lost when using the TBLW instruction. 


If the MP/MC pin on the TMS320C25 is low at the time of execution of this 
instruction and the program memory address used is less than 4096, an 
on-chip ROM location will be addressed but not written to. 


, = 
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TBLW Table Write TBLW 


Cycles 
Cycle Timings for a Single Instruction 
Pi/DI PI/DE PE/DI PE/DE PR/DI PR/DE 
‘20 | Table in internal program memory: 
3 3+d 3+p 3t+d+p 
Table in external program memory: : 
3+p 4+d+p 3+2p 4+d+2p 
"C25 Table in on-chip RAM: 
2 3+d 3+p 4+d+p 
Table in on-chip ROM: 
not applicable 
Table in external memory: 
2+p 3+d+p 3+2p 4+d+2p 
‘20 | Table in internal program memory: 
2+n 2+n+nd 2+n+p 2+n+ndt+p 
Table in external program memory: 
2+n+np 2+2n+nd 2+n+np 2+2n+nd+np 
+np +p +p 
‘C25 Table in on-chip RAM: 
itn 2+n+nd 2+n+p 3+n+ndt+p 
Table in on-chip ROM: , 
not applicable 
Table in external memory: 
T+n+np 1+2nt+nd+np 2+n+np 2+2n+ndt+np 2+n+np 2+2n+nd 
+p +p *+np 
Example TBLW DATS5 (DP = 32) 
TBLW * If current auxiliary register contains 4101. 
Before Instruction After Instruction 


ACC 7257 ACC 


Program 


Program 
Memory >306 Memory > 4339 


Data Data 
Memory >4339 Memory >4339 
4101 4101 


257 257 
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TRAP Software Interrupt TRAP 


Syntax [<label>] TRAP 

Operands None 

Execution (PC) + 1 ~ stack 
30 ~ PC 


Not affected by INTM; does not affect INTM. 


Encoding 15 14 13 12 1110 9 8 7 6 5 4 3 2 1 =O 
1 1 0 0 1 1 1 0 0 0 0 1 1 1 1 #O 


Description The TRAP instruction is a software interrupt that transfers program control 
to program memory location 30 and pushes the program counter plus one 
onto the hardware stack. The instruction at location 30 may contain a 
branch instruction to transfer control to the TRAP routine. Putting the PC 
+ 1 onto the stack enables an RET instruction to pop the return PC (points 
to instruction after the TRAP) from the stack. 


Words 1 


Cycles 
Cycle Timings for a Single Instruction 
PI/DI PI/DE PE/DI PE/DE PR/DI PR/DE 
2 


of 2  { 2) | te Tp 


‘C25 Destination on-chip RAM: 
2 2+p 2+p 2 2 
Destination on-chip ROM: 
3 3+p 3+p 3 3 
Destination external memory: 
3+p 3+p 3+2p 3+p 


Cycle Timings for a Repeat Execution 


20| notrepeatable EL 
’C25 not repeatable 


Example TRAP Control is passed to program memory location 
30. PC + 1 is pushed onto the stack. 
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XOR Exclusive-OR with Accumulator 


Syntax 
Direct: [<label>] XOR <dma> 
Indirect: [<label>] XOR {ind}[,<next ARP>] 


’ Operands O < dma < 127 
O < next ARP < 7 


Execution (PC) +17 PC 
(ACC(15-0)).XOR.dma ~ ACC(15-0) 
(ACC(31-16)) ~ ACC(31-16) 


Not affected by SXM. 
Encoding 15 14 13 12 11 10 9 


Direct: A RICE Data Memory Address 
Indirect|0 1 0 0 1 #14 +O Of1 | See Section 4.1 


Description The low half of the accumulator is exclusive-ORed with the contents of the 
addressed data memory location. The upper half of the accumulator is not 


affected by this instruction. 
Words 1 
Cycles 


'20 


A Se ee 
cop} 1 | ed tte To tdte Pt Td 
20} on | antnd [inte | 2ntnd+p | - | 
c25 {on | tentnd | ontp | itntnd+p| on | tintnd 


Example XOR DAT127 (DP = 511) 
or 
XOR * If current auxiliary register contains 65535. 
Before Instruction After Instruction 


Data Data 
Memory >FOFO memory | > FOFO | 
65535 65535 
ACC >12345678 ACC >1234A688 
c c 
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XORK XOR Immediate with Accumulator with Shift XORK 


Syntax 
Operands 


Execution 


Encoding 


Description 


Words 
Cycles 
‘20 
“C25 
‘20 
"C25 
Example 


[<label>] XORK <constant>[,<shift>] 


16-bit constani 
0 < shift < 15 (defaults to 0) 


(PC) +27 PC 
(ACC(30-0)).XOR.[constant x 2shift ] - ACC(30-0) 
(ACC(31)) > ACC(31) 


Not affected by SXM. 
15 14 13 12 11 10 9 8 FJ 6 5 4 3 2 1 =O 


oo 0 Oo oO 41 1 0 
16-Bit Constant 


The left-shifted 16-bit immediate constant is exclusive-ORed with the .ac- 
cumulator, leaving the result in the accumulator. Low-order bits below and 
high-order bits above the shifted. value are treated as zeroes, thus. not af- 
fecting the corresponding bits of the accumulator. Note that thé most-sig- 
nificant bit of the accumulator is-not affected, regardless of the shift code 
value. 


N 


[2 | 2 | 2 | 2» |.- | - 
a NE 
[not repeatable dT Sd 


XORK >FFFF,8 


Before Instruction After Instruction 


Cc Cc 
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ZAC 


Syntax 
Operands 


Execution 


Encoding 


Description 


Words 
Cycles 


Example 
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a aes Re ee a ee 
co ns Se 


Zero Accumulator ZAC 


[<label>] ZAC 
None 


(PC) +17 PC 
07> ACC 


15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 #0 
1 7? 0O O 1 0 14 0 0 0 0 0 0 0 0 0 


The contents of the accumulator are replaced with zero. The ZAC instruc- 
tion has been implemented as a special case of LACK. (ZAC assembles as 
LACK 0.) 


1 


Cycle Timings for a Single Instruction 
PI/DI PI/DE PE/DI PE/DE PR/DI . PR/DE 


Cycle Timings for a Repeat Execution 
[SSCS not repeatable dS 
not repeatable 


ZAC 


Before Instruction After Instruction 


Cc C 


Zero Low Accumulator 


ZALH and Load High Accumulator ZALH 
Syntax 


Direct: [<label>] ZALH <dma> 
Indirect: [<label>] ZALH {ind}[,<next ARP>] 


Operands 0 < dma s 127 
O < next ARP < 7 


Execution (PO) Tt = PC 
0 ~ ACC(15-0) 
(dma) ~ ACC(31-16) 


Encoding 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 90 


Direct:| O 1 0 0 0 0 0 0 Data Memory Address 
Indirect:|0 1 0 0 0 0 oO of1 | See Section 4.1 


Description ZALH loads a data memory value into the high-order half of the accumula- 
tor. The low-order bits of the accumulator are zeroed. 


ZALH is useful for 32-bit arithmetic operations. 


Cycle Timings for a Single Instruction 
PI/DI PI/DE PE/DI PE/DE PP/DI PR/DE 


Cycle Timings for a Repeat Execution 
20"): 2m =| ned 2) py oh 2nd ep Si) 
25 [-_n | tentnd | ntp | t+ntndvp| | 1nend 


Example ZALH DAT3 (DP = 32) 
or 
ZALH * If current auxiliary register contains 4099. 


Before Instruction After Instruction 


Data Data 
Memory >3F01 Memory >3FO1 


4099 4099 


Cc Cc 
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Zero Low Accumulator, Load High 
ZALR Accumulator with Rounding (TMS320C25 ZALR 


Syntax 
Direct: [<label>] ZALR <dma> 
indirect: [<label>] ZALR {ind}[,<next ARP>] 


Operands 0 < dma s 127 
0 < next ARP < 7 


Execution (PC) +17 PC 
>8000 ~ ACC(15-0) 
(dma) ~ ACC(31-16) 


Encoding 15 14 13 12 1110 9 8 7 6 5 4 3 2 1 =O 


Direct:| O 1 1 1 1 Oo 1 1 Data Memory Address 
Indirect:} O01 1 1 1 =O 4 141 | See Section 4.1 


Description The ZALR instruction loads a data memory value into the high-order half 
of the accumulator with rounding the value by adding 1/2 LSB; i.e., the 15 
low bits (bits 0-14) of the accumulator are set to zero and bit 15 of the 
accumulator is set to one. 


ZALR is a derivative instruction from ZALH. 
Words 1 


Cycles . 
| 
cos[ 1 | 24 | ‘+p | 2+dep | 1 | 2+a | 


25 [a [ tenend [nt [tenendep] on | tenend 
Example ZALR DAT3 (DP = 32) 
or 
ZALR * If current auxiliary register contains 4099. 
Before Instruction After Instruction 
Data Data 
Memory >3F01 Memory >3F01 
4099 4099 


C C 
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Zero Accumulator, Load Low Accumulator 


ZALS with Sign-Extension Suppressed ZALS 
Syntax 


Direct: [<label>] ZALS <dma> 
Indirect: [<label>] ZALS {ind}[,<next ARP>] 


Operands 


Execution 


Encoding 


Direct: 


Indirect: 


Description 


Words 
Cycles 


Example 


‘20 
"C25 


‘20 


"C25 


0 <dma < 127 
O < next ARP <s 7 


(PC) +17 PC 
0 > ACC(31-16) 
(dma) > ACC(15-0) 


Not affected by SXM. 
15 14 13 12 1110 9 8 7 6 5 4 3 2 1 #0 
1 


0 1 0 (0) 0 0 0 Data Memory Address 


0 1 0 0 0 0 0 141] See Section 4.1 


The contents of the addressed data memory location are loaded into the 16 
low-order bits of the accumulator. The upper half of the accumulator is 
zeroed. The data is treated as a 16-bit unsigned number rather than a 
two’s-complement number. Therefore, there is no sign-extension with this 
instruction, regardless of the state of SXM. (ZALS behaves the same as a 
LAC instruction with no shift and SXM = 0.) 


ZALS is useful for 32-bit arithmetic operations. 
1 


Cycle Timings for a Single Instruction 


PI/DI PI/DE PE/DI PE/DE PR/DI 
2+d 


a ee 


Cycle Timings for a Repeat Execution 


14+n+nd 


ZALS DAT1 (DP = 6) 


or ; 
ZALS * If current auxiliary register contains 769. 
Before instruction After Instruction 
Data . Data 
Memory | >F7FF Memory >F7FF 
769 : 769 


acc |x| [_>7FF00033 | acc [x] | >FIFF | 
c c 
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5. Software Applications 


The TMS320C2x microprocessor/microcomputer design emphasizes overall 
speed, communication, and flexibility. Many instructions are tailored to digital 
signal processing tasks, providing single-cycle multiply/accumulates, adaptive 
filtering support, and many other features. General-purpose instructions sup- 
port floating-point, extended-precision, logical processing, and control appli- 
cations. 


This section provides explanations of how to use the various TMS320C2x 
processor and instruction set features along with assembly language coding 
examples. More information about specific applications can be found in the 
book, Digital Signal Processing Applications with the TMS320 Family. 


Major topics discussed in this section are listed below. 


C] Processor Initialization (Section 5.1 on page 5-2) 


e Program Control (Section 5.2 on page 5-7) 
Subroutines 
Software stack 
Timer operation 
Single-instruction loops 
Computed GOTOs 


° Interrupt Service Routines (Section 5.3 on page 5-16) 
Context switching 
Interrupt priority 


© Memory Management (Section 5.4 on page 5-23) 
Block moves 
Configuring on-chip RAM 
Using on-chip RAM for program execution 


© Fundamental Logical and Arithmetic Operations (Section 5.5 on page 
5-31) 
Status register effects 
Bit manipulation 


e Advanced Arithmetic Operations (Section 5.6 on page 5-34) 
Overflow management 
Scaling 
Moving data 
Multiplication 
Division 
Floating-point arithmetic 
Indexed addressing 
Extended-precision arithmetic 


@ Application-Oriented Operations (Section 5.7 on page 5-60) 
Companding 
Filtering 
Fast Fourier Transforms (FFT) 
PID control. 
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Software Applications - Processor Initialization 


5.1 Processor Initialization 


5-2 


Prior to the execution of a digital signal processing algorithm, it is necessary 
to initialize the processor. Generally, initialization takes place anytime the 
processor is reset. 


When reset is activated by applying a low level to the RS (reset) input for at 
least three cycles, the TMS320C2x terminates execution and forces the pro- 
gram counter (PC) to zero. Program memory location 0 normally contains a 
B (branch) instruction in order to direct program execution to the system in- 
itialization routine. The hardware reset also initializes various registers and 
status bits. 


After reset, the processor should be initialized to meet the requirements of the 
system. Instructions should be executed that set up operational modes, me- 
mory pointers, interrupts, and the remaining functions necessary to meet sys- 
tem requirements. 


To configure the processor after reset, the following internal functions should 
be initialized: 


@ Memory-mapped registers 


S Interrupt structure 
e Mode control (OVM, SXM, FO, TXM, PM; plus HM and FSM on 
TMS320C25) | 


e Memory control (CNF) 
e Auxiliary registers and the auxiliary register pointer (ARP) 
8 Data memory page pointer (DP). 


The OVM (overflow mode), TC (test/control flag), and IMR (interrupt mask 
register) bits are not initialized by reset. The auxiliary register pointer (ARP), 
auxiliary register pointer buffer (ARB), and data memory page pointer (DP) 
are also not initialized by reset. 


Example 5-1 and Example 5-2 show coding for initializing the TMS32020 and 
TMS320C25, respectively, to the following machine state, in addition to Me 
initialization performed during the hardware reset: 


e All interrupts enabled 

e Overflow mode (OVM) disabled 

8 Data memory page pointer (DP) set to zero 

@ Auxiliary register pointer (ARP) set to four (TMS32020) or seven 
(TMS320C25) 

@ Internal memory filled with zero. 
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Example 5-1. Processor Initialization (TMS32020) 


TITL 'PROCESSOR INITIALIZATION' 
IDT ‘EXAMPLE! 

DEF RESET,INTO,INT1,INT2 

DEF TINT,RINT,XINT,USER 

REF ISRO,ISR1,ISR2 

REF TIME,RCV,XMT,PROC 


PROCESSOR INITIALIZATION FOR THE TMS32020. 
RESET AND INTERRUPT VECTOR SPECIFICATION. 
* BRANCHES FOR EXTERNAL AND INTERNAL INTERRUPTS. 


* % 


AORG >0000 


RESET B INIT  j; RS- BEGINS PROCESSING HERE. 

* 

INTO B ISRO ; INTO- BEGINS PROCESSING HERE. 
INT1l B ISR1 ; INT1- BEGINS PROCESSING HERE. 
INT2 B ISR2 ; INT2- BEGINS PROCESSING HERE. 
* 

AORG >0018 
TINT B TIME TIMER INTERRUPT PROCESSING. 


; 
RINT B RCV ; SERIAL PORT RECEIVE PROCESSING. 
XINT B XMT ; SERIAL PORT TRANSMIT PROCESSING. 
* 


USER B PROC ; TRAP VECTOR PROCESSING BEGINS. 

* 

THE BRANCH INSTRUCTION AT PROGRAM MEMORY LOCATION 0 DIRECTS 
EXECUTION TO BEGIN HERE FOR RESET PROCESSING THAT INITIAL- 
IZES THE PROCESSOR. WHEN RESET IS APPLIED, THE FOLLOWING 
CONDITIONS ARE ESTABLISHED FOR THE STATUS AND OTHER 
INTERNAL REGISTERS: 


ARP OV OVM 1 INTM DP 
STO: XXX 0 x 1 1 XXXXXXXXX 
ARB CNF TC SXM 11111 XF FO TXM PM 
STl: XXX 0) xX x 112111 1 0 0 XX 
REGISTER ADDRESS DATA 
DRR >0000 XXXX XXXX XXXX XXXX 
DXR >0001 XXXX XXXX XXXX XXXX 
TIM >0002 1111 1111 1111 1111 
PRD >0003 XXXX XXXX XXXX XXXX 
IMR >0004 1111 1211 LIXX XXXX 
GREG >0005 1111 1111 0000 0000 


" RESERVED XINT RINT TINT INT2 INTL INTO 
IMR: 1111111111 x x xX x xX x 


+ ee eH He HH HH HF HF HF HF HF He HF HF HF HH KF 


INIT ROVM DISABLE OVERFLOW MODE. 


‘ 
LDPK 0 ; POINT DP REGISTER TO DATA PAGE 0O. 
LARP 4 ; POINT TO AUXILIARY REGISTER 4. 
LACK >3F ; LOAD ACCUMULATOR WITH >3F. 
SACL 4 ; ENABLE ALL INTERRUPTS “VIA IMR. 
LALK >FFFF ; LOAD ACCUMULATOR WITH OFFFT. 
SACL 3 » INITIALIZE PERIOD REGISTER. 
SSXM ; SET SIGN-EXTENSION MODE TO 1. 
SPM O 7 


SET PM BITS TO O. 
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+ + 


+ + eH 


INTERNAL DATA MEMORY INITIALIZATION. 


ZAC 

LARK 
RPTK 
SACL 


LRLK 
RPTK 
SACL 


LRLK 
RPTK 
SACL 


AR4,>60 


+ 


AR4,>300 
255 


R4+ 


. 
a 
° 
, 


we 


’ 


‘ 


ZERO THE ACCUMULATOR. 
POINT TO BLOCK B2. 


STORE ZERC IN ALL 32 LOCATIONS. 
POINT TO BLOCK BO. 

ZERO ALL OF PAGES 4 AND 5. 
POINT TO BLOCK Bl. 


ZERO ALL OF PAGES 6 AND 7. 


THE PROCESSOR IS INITIALIZED. THE REMAINING APPLICATION- 
' DEPENDENT PART OF THE SYSTEM (BOTH ON- AND OFF-CHIP) SHOULD 


NOW 3E INITIALIZED. 


EINT 


, 


ENABLE ALL INTERRUPTS. 
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Example 5-2. Processor Initialization (TMS320C25) 


TITL 
IDT 
DEF 
DEF 
REF 
REF 


* PROCESSOR 
* RESET AND 


"PROCESSOR INITIALIZATION' 
"EXAMPLE! 
RESET, INTO, INT1, INT2 
TINT, RINT, XINT,USER 

ISRO, ISR1,ISR2 

TIME ,RCV,XMT,PROC 


INITIALIZATION FOR THE TMS320C25. 
INTERRUPT VECTOR SPECIFICATION. 


* BRANCHES FOR EXTERNAL AND INTERNAL INTERRUPTS. 


* 


AORG 
RESET B 
* 
INTO 
INTL 
INT2 
* 


TINT B 
RINT B 
XINT B 
* 


USER 
* 


tw 


THE BRANCH 
EXECUTION 
IZES THE P 
CONDITIONS 
INTERNAL R 


>0000 

INIT ; RS- BEGINS PROCESSING HERE. 

ISRO ; INTO- BEGINS PROCESSING HERE. 
ISR1L ; INT1- BEGINS PROCESSING HERE, 
ISR2 ; INT2- BEGINS PROCESSING HERE. 
>0018 

TIME ; TIMER INTERRUPT PROCESSING. 

RCV ; SERIAL PORT RECEIVE PROCESSING. 
XMT ; SERIAL PORT TRANSMIT PROCESSING. 
PROC ; TRAP VECTOR PROCESSING BEGINS. 
INSTRUCTION AT PROGRAM MEMORY LOCATION 0 DIRECTS 


TO BEGIN HERE FOR RESET PROCESSING THAT INITIAL~ 


e+ ee + HF HF HH FH HF HF HF He HF 


a 
by 


* 


INIT 


ARP 

STO: XXX 
ARB 
STIL: XXX 
REGISTER 

DRR 

DXR 

TIM 

PRD 

IMR 

GREG 


RE 
IMR: 111 
ROVM 
LDPK 
LARP 
LACK 
SACL 


ROCESSOR. WHEN RESET IS APPLIED, THE FOLLOWING 
ARE ESTABLISHED FOR THE STATUS AND OTHER 
EGISTERS: 
OV OVM 1 INTM DP 
0 x 1 i XXXXXAXXX 
CNF TC SXM C 11 HM FSM XF FO TXM PM 
0 x 1 1 11 ue 1 1 0 0 00 
ADDRESS DATA 
>0000 XXXX XXXX XXXX XXXX 
>0001 XXXX XXXX XXXX XXXX 
>0002 11711 1111 1111 1111 
>0003 1111 1111 1111 1113 
>0004 1111 21111 11XX XXxX> 
>0005 1111 1111 0000 OO00C 
SERVED XINT RINT TINT INT2 INTL INTO 
1111111 Xx x xX x Xx Xx 
> DISABI.E OVERFLOW MODE 
0 ; POINT DP REGISTER TO DATA PAGE 0. 
7 ; POINT TO AUXILIARY REGISTER 7. 
>3F ; LOAD ACCUMULATOR WITH >3F. 
4 ; ENABLE ALL INTERRUPTS VIA IMR. 
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+ + 


+ ee eH 


INTERNAL DATA MEMORY INITIALIZATION. 


ZAC 

LARK AR7,>60 
RPTK 31 

SACL *+ 


LRLK AR7,>200 
RPTK 255 
SACL” *+ 


LRLK AR7,>300 
RPTK 255 
SACL *+ 


. 
a’ 
. 
tA 
° 
, 


’ 


~e 


a 
a 


ZERO THE ACCUMULATOR. 
POINT TO BLOCK B2. 


STORE ZERO IN ALL 32 LOCATIONS. 
POINT TO BLOCK BO. 

ZERO ALL OF PAGES 4 ANI 5. 
POINT TO BLOCK Bl. 


ZERO ALL OF PAGES 6 AMD 7. 


—— 


THE PROCESSOR IS INITIALIZED. THE REMAINING APPLICATION- 
DEPENDENT PART OF THE SYSTEM (BOTH ON- AND OFF-CHIP). SHOULD 


NOW BE INITIALIZED. 
EINT 


’ 


ENABLE ALL INTERRUPTS. 
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5.2 Program Control 


To facilitate the TMS320C2x’s use in general-purpose high-speed processing, 
a variety of instructions are provided for software stack expansion, subroutine 
calls, timer operation, single-instruction loops, and external branch control. 
Descriptions and examples of how to use these features of the TMS320C2x 
are given in this section. 


5.2.1 Subroutines 


The TMS320C2x has a 16-bit Program Counter (PC) and a four-level 
(TMS32020) or eight-level (TMS320C25) hardware stack for PC storage. 
The CALL and CALA subroutine calls store the current contents of the pro- 
gram counter on the top of the stack. The RET (return from subroutine) in-: 
struction then pops the top of the stack to the program counter. 


Example 5-3 illustrates the use of a subroutine to determine the square root 
of a 16-bit number. Processing proceeds in the main routine to the point 
where the square root of a number should be taken. At this point a CALL is 
made to the subroutine, transferring control to that section of the program 
memory for execution and then returning te the calling routine via the RET 
instruction when execution has completed. 
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Example 5-3. Subroutines 
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+ ot oe ek OH OH 


AUTOCORRELATION 


CALLS A SQUARE ROOT SUBROUTINE THAT. WILL DETERMINE THE RMS 
AMPLITUDE OF THE WAVEFORM. 


56 
* 
* THIS ROUTINE PERFORMS A CORRELATION OF TWO VECTORS.AND THEN 
* 
* 
* 


AUTOC 
LAC 
CALL 
SACL 


+ 


STO EQU 
ST1 EQU 
NUMBER EQU 
TEMPR EQU 
GUESS EQU 


SQRT SST 
SST 


LDPK . 


SSXM 
SPM 

sACL 
LARP 
LARK 
LALK 
SACL 
SACL 
SACH 
LAC 

SBLK 
BLZ 

LAC 

SACL 
SACL 
LARK 

* 


ENERGY 
SORT 
ENERGY 


SQUARE ROOT 


>60 
>61 
262 
>63 
>64 


NUMBER 
AR1 
AR1,11 
>800 
GUESS 
TEMPR 
ROOT 
NUMBER 
>200 
SORTLP 
GUESS , 3 
GUESS 
TEMPR 
AR1,14 


* SQUARE ROOT LOOP 


* 


SQORTLP SQRA 
ZALH 
SPAC 
BLZ 
ZALH 
SACH 


TEMPR 
NUMBER 


NEXTLP 
TEMPR 
ROOT 


we Ne we we Ne 


= we 


we Ne Ne Ne Ne Ne Ne Ne Ne Ns Ne Ne Ne Ne Ne Ne Ns 


we Ne 


we Ne 


THIS SUBROUTINE DETERMINES THE SQUARE ROOT OF A NUMBER X 
THAT IS LOCATED IN THE LOW HALF OF THE ACCUMULATOR WHEN 
THE ROUTINE IS CALLED. THE FRACTIONAL SQUARE ROOT OF X IS 
TAKEN, WHERE 0 < X < 1 AND WHERE 1 IS REPRESENTED BY 
>7FFF. THE RESULT IS RETURNED TO THE CALLING ROUTINE IN 
THE ACCUMULATOR 


SAVED STATUS REGISTER STO ADDRESS 
SAVED STATUS REGISTER ST1 ADDRESS 
NUMBER X WHOSE SQUARE ROOT IS TAKEN 
INTERMEDIATE ROOTS 

SQUARE ROOT OF X 


SAVE STATUS REGISTER STO. 

SAVE STATUS REGISTER ST1. 

LOAD DATA PAGE POINTER = O. 

SET SIGN-EXTENSION MODE. 

LEFT-SHIFT PR OUTPUT TC ACCUMULATOR. 
SAVE X. 

INITIALIZE VARIABLES FOR SQUARE ROOT. 
12 ITERATIONS 

ASSUME X IS LESS THAN >200. 

SET INITIAL GUESS TO >800. 

SET FIRST INTERMEDIATE ROOT TO >800. 
SET SQUARE ROOT VALUE TC O. 

LOAD X INTO THE ACCUMULATOR. 

TEST IF X IS LESS THAN >200. 

IF YES, TAKE THE ROOT; 

IF NO, THEN REINITIALIZE. 

SET INITIAL GUESS TO >4000. 

SET FIRST INTERMEDIATE ROOT TO >4000. 
15 ITERATIONS 


SQUARE TEMPORARY (INTERMEDIATE) ROOT. 
CHECK IF RESULT IS LESS THAN X. 


IF IT'S NOT, SKIP ROOT UPDATE. 
IF IT IS, SET ROOT EQUAL TEMPR. 
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NEXTLP LAC GUESS,15 
SACH GUESS 
ADDH ROOT 
SACH TEMPR 
BANZ . SORTLP 


SCALE DOWN GUESS BY 2 TO CONVERGE. 


we 


ADD CURRENT ROOT ESTIMATE. 
UPDATE TEMPORARY ROOT VALUE. 
REPEAT SPECIFIED NO. OF ITERATIONS. 


ue Se NO Ne Ne Me 


LAC ROOT LOAD THE ROOT OF X. 

LST1 ST1 RESTORE STATUS REGISTER STi. 
LST STO RESTORE STATUS REGISTER STO. 
RET 


Hardware stack allocation involves its use in interrupts, subroutine calls, 
pipelined instructions, and the emulator (XDS). The TMS320C2x disables all 
interrupts when taking an interrupt trap. If interrupts are enabled more than. 
one.instruction before the return of the interrupt service routine, the routine 
can also be interrupted, thus using. another level of the hardware stack. This 
condition should be considered when managing the use of the stack. When 
nesting subroutine calls, each call uses a level of the stack. The number. of 
levels used by the interrupt must be remembered as well as the depth of the 
nesting of subroutines. One level of the stack is reserved for the emulator 
(XDS) to be used for breakpoint/single-step operations. lf the XDS is. not 
used, this extra level is available for internal use. Given these constraints, the 
following listings describe possible allocations of the hardware stack levels: 


TMS32020: 


- 1 level reserved for emulator (XDS) stack 

- 1 level reserved for TRAP (software interrupt) instruction 
- 1 level reserved for interrupt service routines (ISR) 

- 1 level available for subroutine calls. 


TMS320C25: 


- 1 level reserved for emulator (XDS) stack 

- 1. level reserved for TRAP (software interrupt) instruction 
- 1 level reserved for interrupt service routines (ISR) 

- 5 levels available for subroutine calls. 


or: 


- 1.level reserved for emulator (XDS) stack 

- 1 level reserved for TRAP (software interrupt) instruction 
- 2 levels reserved for interrupt service routines (ISR) 

- 4 levels available for subroutine calls. 


When two levels are allocated for ISRs on the TMS320C25, the individual 
ISRs can utilize one level of subroutine calls or one level of interrupt nesting. 
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5.2.2 Software Stack 


Provisions have been made on the TMS320C2x for extending the hardware 
stack into data memory. This is useful for deep subroutine nesting or stack 
overflow protection. 


The hardware stack is accessible via the accumulator using the PUSH and 
POP instructions. Two additional instructions, PSHD and POPD, are included 
in the instruction set so that the stack may be directly stored to and recovered 
from data memory. 


A software stack can be implemented by using the POPD instruction at the 
beginning of each subroutine in order to save the PC in data memory. Then 
before returning, a PSHD is used to put the proper value back onto the top 
of the stack. 


When the stack has three (TMS32020) or seven (TMS320C25) values stored 
on it and two or more values are to be put on the stack before any other values 
are popped off, a subroutine that expands the stack is needed, such as shown 
in Example 5-4. In this example, the main program stores the stack starting 
location in memory in AR2 and indicates to the subroutine whether to push 
data from memory onto the stack or pop data from the stack to memory. Ifa 
zero is loaded into the accumulator before calling the subroutine, the subrou- 
tine pushes data from memory to the stack. If a one is loaded into the accu- 
mulator, the subroutine pops data from the stack to memory. 


Since the CALL instruction uses the stack to save the program counter, the 
subroutine pops this value into the accumulator and utilizes the BACC 
(branch to address specified by accumulator) instruction to return to the main 
program. This prevents the program counter from being stored into a memory 
location. The subroutine in Example 5-4 uses the BANZ (branch on auxiliary 
register not zero) instruction to control! all of its loops. 


Example 5-4. Software Stack Expansion 


* THIS ROUTINE EXPANDS THE STACK WHILE LETTING THE MAIN 

* PROGRAM DETERMINE WHERE TO STORE THE STACK CONTENTS OR FROM 
* WHERE TO RECOVER THEM. 

* 


STACK LARP 2 USE AR2. 


r 
BNZ PO ; IF POPD IS NEEDED, GOTO PO. 
POP 7 ELSE, SAVE PROGRAM COUNTER. 
RPTK 6 ; LOAD REPEAT COUNTER. 
PSHD *+ ; PUT MEMORY IN STACK. 
BACC ; RETURN TO MAIN PROGRAM. 
PO POP ; SAVE PROGRAM COUNTER. 
MAR *- ; ALIGN STACK POINTER. 
RPTK 6 ; LOAD REPEAT COUNTER. 
POPD *- ; PUT STACK IN MEMORY. 
MAR *¥+ ; REALIGN STACK POINTER. 
BACC ; RETURN TO MAIN PROGRAM. 
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5.2.3 Timer Operation 


The TMS320C2x provides a 16-bit on-chip timer and its associated interrupt 
to perform various functions at regular time intervals. The timer is a down 
counter that is continuously clocked by CLKOUT1 on the TMS320C25, and 
counts (PRD + 1) cycles of CLKOUT1. The timer is clocked by CLKOUT1/4 
on the TMS32020, and counts (4 x PRD) cycles of CLKOUT1. By pro- 
gramming the period (PRD) register from 1 to 65,535 (>FFFF), a timer inter- 
rupt (TINT) can be generated every 2 to 65,536 cycles on the TMS320C25. 
Note that a TINT can be generated every 4 to 262,140 cycles on the 
TMS32020. (A period register value of zero is not allowed.) 


Two memory-mapped registers are used to operate the timer. The timer (TIM) 
register, data memory location 2, holds the current count of the timer. At every 
CLKOUT1 cycle, the TIM register is decremented by one. The PRD register, 
data memory location 3, holds the starting count for the timer. When the TIM 
register decrements to zero, a timer interrupt (TINT) is generated. In the fol- 
lowing cycle, the contents of the PRD register are loaded into the TIM register. 
In this way, a TINT is generated every (PRD + 1) cycles of CLKOUT1 on the 
TMS320C25 or (4 x PRD) cycles of CLKOUT1 on the TMS32020. 


The timer and period registers can be read from or written to on any cycle. The 
count can be monitored by reading the TIM register. A new counter period 
can be written to the PRD register without disturbing the current timer count. 
The timer will then start the new period after the current count is complete. If 
both the PRD and TIM registers are loaded with a new period; the timer begins 
decrementing the new period without generating an interrupt. Thus, the pro- 
grammer has complete control of the current and next periods of the timer. 


The TIM register is set to the maximum value on reset (>FFFF) for both the 
TMS32020 and TMS320C25. The PRD register is also initialized by reset on 
the TMS320C25 to >FFFF. The TMS32020 requires a software initialization 
of the PRD register (see Example 5-1). The TIM register begins decrementing 
only after RS is de-asserted. If the timer is not used, TINT should be masked. 
The PRD register can then be used as a general-purpose data memory !o- 
cation. If TINT is used, the PRD and TIM registers should be programmed 
before unmasking the TINT. 
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Example 5-5 and Example 5-6 show the assembly code that implements the 
use of the timer to divide down the CLKOUT1 signal. To generate a 9600-Hz 
clock signal, the PRD register should be loaded with 520. In the timer inter- 
rupt service routine, the XF line is toggled. The XF output is also used as an 
input for BIO in this example. The output of XF will provide a 50-percent duty 
cycle clock signal as long as the main routine or other interrupt routines do 
not disable interrupts. Interrupts may be disabled by direct or implied use of 
DINT, or by executing instructions in the repeat mode. The value for the PRD 


register is calculated as follows: 


TMS32020: 


CLKOUT1/(4 x PRD) = 2 x frequency of signal 


5 MHz/(4 x 65) = 2 x 9600 Hz (= 9615 Hz for divided signal) 


TMS320C25: 


CLKOUT1/(PRD + 1) = 2 x frequency of signal 


10 MHz/(520 + 1) = 2 x 9600 Hz (= 9597 Hz for divided signal) 


Example 5-5. Clock Divider Using Timer (TMS32020) 


* SETUP FOR 
* 


INTERRUPT SERVICE ROUTINE. 


65 

DMA3 ; LOAD THE PERIOD REGISTER. 

8 

DMA4 

DMA4 ; ENABLE THE TIMER INTERRUPT. 
; ENABLE INTERRUPTS. 


* I/O SERVICE ROUTINE. 


* 


TIME BIOZ 
RXF 
EINT 
RET 

SET1 SXF 
EINT 
RET 


SET1 P 

> XF WAS 
; ENABLE 
; RETURN 
; XF WAS 
; ENABLE 
; RETURN 


CHECK THE CURRENT XF STATE. 


HIGH; SET IT LOW. 
INTERRUPTS. 
TO INTERRUPTED CODE. 
LOW; SET IT HIGH. 
INTERRUPTS. 
TO INTERRUPTED CODE. 
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Example 5-6. Clock Divider Using Timer (TMS320C25) 


* SETUP FOR INTERRUPT SERVICE ROUTINE. 
* 


LALK 520 

SACL DMA3 ; LOAD THE PERIOD REGISTER. 
LACK 8 

OR DMA4 

SACL DMA4 ; ENABLE THE TIMER INTERRUPT. 


EINT ; ENABLE INTERRUPTS. 


* I/O SERVICE ROUTINE. 
* 


TIME BIOZ SET1 CHECK THE CURRENT XF STATE. 


7 
RXF ; XF WAS HIGH; SET IT LOW. 
EINT ; ENABLE INTERRUPTS. 
RET ; RETURN TO INTERRUPTED CODE. 
SET1L SXF ; XF WAS LOW; SET IT HIGH. 
EINT ; ENABLE INTERRUPTS. 
RET ; RETURN TO INTERRUPTED CODE. 


5.2.4 Single-Instruction Loops 


When programming time-critical high-computational tasks, it is often neces- 
sary to repeat the same operation many times. For these cases, repeat in- 
structions that allow the execution of the next single instruction N+1 times 
are provided. N is defined by an eight-bit repeat counter (RPTC), which is 
loaded by the RPT or RPTK instructions. The instruction immediately follow- 
ing is then executed, and the RPTC is decremented until it reaches zero. 


When using the repeat feature, the instruction being repeated is fetched only 
once. As a result, many multicycle instructions become single-cycle when re- 
peated. This is especially useful for |/O instructions, such as TBLR/TBLW, 
IN/OUT, or BLKD/BLKP. 


Since the instruction is fetched and internally latched, the program bus can 
be used to fetch or write a second operand in parallel to operations using the 
data bus. With the instruction latched for repeated execution, the program 
counter can be loaded with a data address and incremented on succeeding 
executions to fetch data in successive memory locations. As an example, the 
MAC instruction fetches the multiplicand from program memory via the pro- 
gram bus. Simultaneous with the program bus fetch, the second multiplicand 
is fetched from data memory via the data bus. In addition to these data fetches, 
preparation is made for accesses in the following cycles by incrementing the 
program counter and by indexing the auxiliary register. TBLR is another ex- 
ample of an instruction that benefits from simultaneous transfers of data on 
both the program and data buses. In this case, data values from a table in 
program memory may be read and transferred to data memory. When re- 
peated, the program overhead of reading the instruction from program memory 
must be executed only once, thus allowing the rest of the executions to op- 
erate in a single cycle. 
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Programs, such as those implementing digital filters, require loops that execute 
in a minimum amount of time. Example 5-7 shows the use of the RPT or 
RPTK instructions. 


Example 5-7. Instruction Repeating 


+ t+ FH HF HF 4 FH HH HF HF HF EF 


THIS ROUTINE USES THE RPT INSTRUCTION TO SET UP THE LOOP 
COUNTER IN ONE CYCLE. THE FOLLOWING EQUATION IS IMPLEMENTED 
IN THIS ROUTINE: 


\ X(I) x ¥(Z) 
/ 


THIS ROUTINE ASSUMES THAT THE X VALUES ARE LOCATED IN 
ON-CHIP RAM BLOCK BO, AND THE Y VALUES IN BLOCK Bl. WHEN 
REPLACING RPT NUM WITH RPTK 9, THE PROGRAM WILL EXECUTE 
THE SAME WAY. 


SERIES LARP AR4 


CNFP ; CONFIG BLOCK BO AS PROGRAM MEMORY. 
LACK 9 ; SET COUNTER TO 9. 

SACL NUM ; (NUM) = 9. 

LRLK AR4,>300; POINT AT BEGINNING OF DATA. 

MPYK >0O ; CLEAR P REGISTER. 

ZAC ; CLEAR ACCUMULATOR. 

RPT NUM ; EXECUTE NEXT INSTRUCTION 10 TIMES. 
MAC »>FFOO,*+ ; MULTIPLY-ACCUMULATE; INCREMENT AR4. 
APAC 


RET ; RETURN TO MAIN PROGRAM. 


5.2.5 Computed GOTOs 
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Processing may be executed in a time- and process-dependent or selected 
way. Following a specific time or data processing path may then result in se- 
lecting one of several processing options. 


A simple computed GOTO can be programmed in the TMS320C2x by using 
the CALA instruction. This instruction uses the contents of the accumulator 
as the direct address of the call. Thus, the call address can be computed in the 
ALU, as shown in Example 5-8. 
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Example 5-8. Computed GOTO 


* H+ + HF HH HH He Ft HF 


= 
Ha) 
H 
| 


IDLE 
LAC 
SUB 
BGEZ 
LACK 
OVRSAM SACL 
ADLK 
TBLR 
LAC 
CALA 
B 
* 
TSKSEQ EQU 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 
DATA 


SAMPLE CYCLE, 


TASK CONTROLLER 


SAMPLE 
ONE 
OVRSAM 
15 
SAMPLE 
TSKSEQ 
TEMP 
TEMP 


WAIT 


$ 
DUMMY 
DUMMY 
DUMMY 
DUMMY 
BDCLK2 
DUMMY 
OUT 
DECODE 
DEMODB 
DUMMY 
AGCUPT 
DUMMY 
BDCLK1 
DUMMY 
DUMMY 
DUMMY 


we Ne Ne Ne Ne Ne Ne Ne Ne NO 


“Ne Ne Ne Ne Ne Ne te Ne Se Ne Ne Ne Ne Ne Ne Ne 


WAIT 
FETC 
DECR 
TEST 
INIT 
SAVE 
ADD 

READ 
LOAD 
EXEC 


12 - 
1i-= 
10 - 


OrRNWAUDN~ TOW 
i 


THIS MAIN TASK ROUTINE CONTROLS THE ORDER OF EXECUTION 
AND SCHEDULING OF TASKS. WHEN AN INTERRUPT OCCURS, THE 
INTERRUPT SERVICE ROUTINE IS EXECUTED TO PROCESS THE INPUT 
AND OUTPUT DATA SAMPLES. AFTER THE INTERRUPT SERVICE 
ROUTINE HAS COMPLETED, THE PROCESSOR BEGINS EXECUTION WITH 
THE INSTRUCTION FOLLOWING THE IDLE INSTRUCTION. THIS 
ROUTINE SELECTS THE TASK APPROPRIATE FOR THE CURRENT 
CALLS THE TASK AS A SUBROUTINE, AND BRANCHES 
BACK TO THE IDLE TO WAIT FOR THE NEXT SAMPLE INTERRUPT 
WHEN THE SCHEDULED TASK HAS COMPLETED EXECUTION. 


FOR SAMPLE INTERRUPT. 

H SAMPLE COUNT VALUE. 

EMENT THE SAMPLE COUNT. 

FOR END OF BAUD INTERVAL. 
COUNT FOR NEW BAUD INTERVAL. 
NEW COUNT VALUE. 

TASK TABLE BASE ADDRESS. 
SUBROUTINE TASK ADDRESS. 
ACCUMULATOR FOR TASK CALL. 

UTE APPROPRIATE TASK. 


UNUSED CYCLE 

UNUSED CYCLE 

UNUSED CYCLE 

UNUSED CYCLE 

COMPUTE ENERGY E(11) 

UNUSED CYCLE 

COMMUNICATE WITH U-CONTROLLER 
DECODE/GET SCRAMBLED DIBIT 
DEMODULATE IN MIDDLE OF BAUD 
UNUSED CYCLE 
UPDATE AGC EVERY 3RD BAUD 
UNUSED CYCLE 

COMPUTE ENERGY E(3) 

UNUSED CYCLE 

UNUSED CYCLE 

UNUSED CYCLE 
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5.3 Interrupt Service Routine 


Interrupts on the TMS320C2x are prioritized and vectored. When an interrupt 
occurs, the corresponding flag is set in the Interrupt Flag Register (IFR). lf 
the corresponding bit in the Interrupt Mask Register (IMR) is set and inter- 
rupts are enabled (INTM=0), then interrupt processing begins. 


When the interrupt vector is loaded into the program counter, interrupts are 
disabled (INTM=1) and a branch is made to the appropriate routine via the 
branch instruction stored at the associated vector location. Since all interrupts 
are disabled, interrupt processing may proceed without further interruption 
unless the interrupt service routine (ISR) re-enables interrupts. 


Unless the interrupt service routines are simple |/O handlers, the processing 
in each ISR generally must assure that the processor context is preserved 
during execution. The context must be saved before executing the routine it- 
self and restored when the routine is finished. A common routine or routines 
individualized for each interrupt may be used to secure the context of the 
processor during interrupt processing. Context switching is also useful for 
subroutine calls, especially when extensive use is made of the stack or auxil- 
iary registers. Code examples of context switching and an interrupt service 
routine are provided in this section. 


5.3.1 Context Switching 
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Context switching, commonly required when processing a subroutine call or 
interrupt, may be quite extensive or simple, depending on the system require- 
ments. On the TMS320C2x, the program counter is stored automatically on 
the hardware stack. If there is any important information in the other 
TMS320C2x registers, such as the status or auxiliary registers, these must be 
saved by software command. A stack in data memory, identified by an auxil- 
iary register, is useful for storing the machine state when processing interrupts. 


Examples of saving and restoring the state of the TMS32020 are given in Ex- 
ample 5-9 and Example 5-10. Auxiliary register 4 (AR4) is used in both ex- 
amples as the stack pointer. As the stack grows, it expands into lower memory 
addresses. The registers saved are the status registers (STO and ST1), accu- 
mulator (ACCH and ACCL), product register (PR), temporary register (TR), 
all four levels of the hardware stack, and the auxiliary registers (ARO through 
AR4). 
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Example 5-9. Context Save (TMS32020) 


+ ee He 


SAVE 


TITL 
DEF 


‘CONTEXT SAVE' 


SAVE 


CONTEXT SAVE ON SUBROUTINE CALL OR INTERRUPT. 


ASSUME AR4 IS THE STACK POINTER AND AR4 


LARP 
MAR 


SAVE THE 
SST1 
ssT 


SAVE THE 
SACH 
SACL 


SAVE THE 
SPM 
PAC 
SACH 
SACL 


SAVE THE 
MPYK 
PAC 
SACL 


SAVE ALL 
RPTK 
POPD 


4 


ke 


. 
‘ 


. 
a, 


(ARP) 


STATUS REGISTERS. 
; ST1 -> (127), 
STO -> (126), 


ke 
*— 


’ 


ACCUMULATOR. 
; ACCH -> (125), 
; ACCL -> (124), 


a 
ko 


P REGISTER. 


NO SHIFT ON PR OUTPUT 


-> ARB, 


0 ; 

ke ; PRH ~> (123), 
ke ; PRL -> (122), 
T REGISTER. 

>1 

-= ; TR 


FOUR LEVELS OF THE 


3 


Km 


° 
’ 
. 
, 
’ 
’ 


=? (121), 


128. 


4 -> ARP,AR4 


AR4 


AR4 
AR4 


AR4 
AR4 


AR4 
AR4 


AR4 


HARDWARE STACK. 


SAVE AUXILIARY REGISTERS ARO THROUGH AR3. 


SAR 
SAR 
SAR 
SAR 


SAVE IS COMPLETE. 


ARO,*- 
AR1,*- 
AR2,*- 
AR3,*- 


a 
. 
f 
s 
. 
td 


TOS (4) -> (120), 

STACK(3) -> (119), 

STACK(2) -> (118), 

BOS (1) -> (117), 
; ARO -> (116), 

AR1L -> (115), 

AR2 -> (114), 

AR3 -> (113), 


AR4 
AR4 
AR4 
AR4 


AR4 
AR4 
AR4 
AR4 


tt ot 


ho oul 


128 
127 


126 
125 


124 
123 


122 


121 


120 


119 
118 
117 
116 


115 
114 
113 
112 
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Example 5-10. Context Restore (TMS32020) 


TITL 'CONTEXT RESTORE’ 
DEF RESTOR 


* 
* CONTEXT RESTORE AT THE END OF A SUBROUTINE OR INTERRUPT. 
* 
* ASSUME AR4 IS THE STACK POINTER AND AR4 = 112. 
* 
RESTORLARP 4 ; (ARP) -> ARB, 4 -> ARP,AR4 = 112 
MAR *+ ; AR4 = 113 
* 
* RESTORE AUXILIARY REGISTERS ARO THROUGH AR3. 
LAR AR3,*+ ; (113) -> AR3, AR4 = 114 
LAR AR2,*+ ; (114) -> AR2, AR4 = 115 
LAR AR1,*+ ; (115) -> ARI, AR4 = 116 
LAR ARO,*+ ; (116) -> ARO, AR4 = 117 
* 
* RESTORE ALL FOUR LEVELS OF THE HARDWARE STACK. 
RPTK 3 
PSHD *+ ; (117) -> BOS (1), AR4 = 118 
; (118) -> STACK(2), AR4 = 119 
; (119) -> STACK(3), AR4 = 120 
; (120) -> Tos (4), AR4 = 121 


* 
* THE RETURN PC IS NOW ON THE HARDWARE STACK FOR THE 
'* RET INSTRUCTION. NOTE THAT THE LOWER 16 BITS OF THE 
* P REGISTER MUST BE LOADED VIA THE T REGISTER AND THAT 
* THE STACK POINTER IS POINTING AT THE VALUE TO BE LOADED 
* IN THE T REGISTER. 
* 
* RESTORE THE LOW P REGISTER. 
MAR ¥*+ ; AR4 = 122 
LT *= p CL22) => TR, AR4 = 121 
MPYK >1 ; (TR) => PRL, AR4 = 121 
x 
* RESTORE THE T REGISTER. 
LT ¥+ ; (121) -> TR, AR4 = 122 
MAR *+ : AR4 = 123 
x 
* RESTORE THE HIGH P REGISTER. 
| LPH *+ ; (123) -> PRH, AR4 = 124 
* 
* RESTORE THE ACCUMULATOR. 
ZALS *+ ; (124) => aAccL, AR4 = 125 
ADDH *+ ; (125) -> ACCH, AR4 = 126 
* 
* RESTORE THE STATUS REGISTERS. 
LST *+ ; (126) -> STO, AR4 = 127 
LST1 * ; (127) -> STI, AR4 = 128 
* 
* RESTORE IS COMPLETE. 
EINT ; ENABLE INTERRUPTS. 
RET ; RETURN TO INTERRUPTS OR CALLING ROUTINE. 
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Examples of saving and restoring the state of the TMS320C25 are given ir 
Example 5-11 and Example 5-12. Auxiliary register 7 (AR7) is used in both 
examples as the stack pointer. As the stack grows, it expands into lower 
memory addresses. The registers saved are the status registers (STO and ST1), 
accumulator (ACCH and ACCL), product register (PR), temporary register 
(TR), all eight levels of the hardware stack, and the auxiliary registers (ARO 
through AR6). 


The routines in Example 5-11 and Example 5-12 are protected against inter- 
rupts, allowing context switches to be nested. This is accomplished by the 
use of the MAR *- and MAR “*+ instructions at the beginning of the context 
save and context restore routines, respectively. Note that the last instruction 
of the context save decrements AR7 while the context restore is completed 
with an additional increment of AR7. This prevents the loss of data if a context 
save or restore routine is interrupted. 
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Example 5-1 1. Context Save (TMS320C25) 
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TITL 'CONTEXT SAVE' 
DEF SAVE 
* 
* CONTEXT SAVE ON SUBROUTINE CALL OR INTERRUPT. 
* 
* ASSUME AR7 IS THE STACK POINTER AND AR7 = 128. 
* 
SAVE LARP AR7 ; (ARP) -> ARB, 7 -> ARP,AR7 
MAR *- ; AR7 
* . 
* SAVE THE STATUS REGISTERS. 
SST1 *- ; ST1 -> (127), AR7 
SST *- ; STO -> (126), AR7 
* 
* SAVE THE ACCUMULATOR. 
SACH *- ; ACCH -> (125), AR7 
SACL *- ; ACCL -> (124), AR7 
* 
* SAVE THE P REGISTER. 
SPM 0O ; NO SHIFT ON PR OUTPUT 
SPH #*- ; PRH -> (123), AR7 
SPL *- ; PRL -> (122), AR7 
* 
* SAVE THE T REGISTER. 
MPYK 1 ; PR = TR 
SPL *- ; TR -> (121), AR7 
* 
* SAVE ALL EIGHT LEVELS OF THE HARDWARE STACK. 
RPTK 7 
POPD *- ; TOS (8) -> (120), AR7 
= ; STACK(7) -> (119), AR7 
id ; STACK(6) -> (118), AR7 
* ; STACK(5) <-> (117), AR7 
* ; STACK(4) -> (116), AR7 
* ; STACK(3) -> (115), AR7 
* ; STACK(2) -> (114), AR7 
* ; BOS (1) -> (113), AR7 
‘ 
* SAVE AUXILIARY REGISTERS ARO THROUGH AR6. 
SAR ARO,*- ; ARO -> (112), AR7 
SAR AR1,*- ; AR1 -> (111), AR7 
SAR AR2,*- ; AR2 -> (110), AR7 
SAR AR3,*- ; AR3 -> (109), AR7 
SAR AR4,*- ; AR4 -> (108), AR7 
SAR ARS5,*- ; ARS -> (107), AR7 
SAR ARG6,*- ; AR6 -> (106), AR7 


SAVE IS COMPLETE. 


toll 


Ho Wow th tt th al 


io to nw ow neat 


128 
127 


126 
125 


124 
123 


122 
121 


120 
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Example 5-12. Context Restore (TMS320C25) 


e t+ He HF 


* 
* 


* 


TITL 


DEF 


"CONTEXT RESTORE' 
RESTOR 


CONTEXT RESTORE AT THE END OF A SUBROUTINE OR INTERRUPT. 


ASSUME AR7 IS THE STACK POINTER AND AR7 = 105. 


RESTORLARP ART ; (ARP) -> ARB, 7 -> ARP,AR7 = 105 
MAR *+ ; AR7 = 106 
RESTORE AUXILIARY REGISTERS ARO THROUGH ARG. 

LAR ARG,*+ ; (106) -> ARG, AR7 = 107 

LAR ARS,*+ ; (107) -> ARS, AR7 = 108 

LAR AR4,*+ ; (108) -> AR4, AR7 = 109 

LAR AR3.*+ ; (109) => AR3, AR7 = 110 

LAR AR2,*+ ; (110) -> AR2, AR7 = 111 

LAR AR1,*+ ; (111) -> ARI, AR7 = 112 

LAR ARO,*+ ; (112) -> ARO, AR7 = 113 

RESTORE ALL EIGHT LEVELS OF THE HARDWARE STACK. 

RPTK 7 

PSHD *+ ; (113) => BOS (1), AR7 = 114 

; (114) -> STACK(2), ART = 115 

; (115) -> STACK(3), AR7 = 116 

; (116) -> STACK(4), AR7 = 117 

; (117) -> STACK(5), AR7 = 118 

; (118) -> STACK(6), AR7 = 119 

; (119) -> STACK(7), AR7 = 120 

; (120) -> TOS (8), AR7 = 121 


+e + ee HF HF HH HF HF HF HF F 


THE RETURN PC IS NOW ON TOP OF THE STACK FOR THE RET 
INSTRUCTION. THE LOWER 16 BITS OF THE P REGISTER MUST 
BE LOADED VIA THE T REGISTER AND THE STACK POINTER BE 
POINTING AT THE VALUE TO BE LOADED IN THE T REGISTER. 


RESTORE THE LOW P REGISTER. 


MAR . *+ ; SKIP T REGISTER, AR7 = 122 
LT *- ; (122) -> TR, AR7 = 121 
MPYK 1 ; (TR) -> PRL 
RESTORE THE T REGISTER. 
LT = *+ ; (121) -> TR, ART = 122 
MAR *+ ; SKIP P REGISTER LOW, AR7 = 123 
RESTORE THE HIGH P REGISTER. 
LPH *+ ; (123) -> PRH, AR7 = 124 
RESTORE THE ACCUMULATOR. 
ZALS *+ ; (124) => ACCL, AR7 = 125 
ADDH *+ ; (125) -> ACCH, AR7 = 126 
RESTORE THE STATUS REGISTERS. 
LST *+ ; (126) -> STO, AR7 = 127 
LST1L *+ ; (127) -> sT1, AR7 = 128 


RESTORE IS COMPLETE. 
EINT 


RET 


. 
a 


e 
af 


ENABLE INTERRUPTS. 
RETURN TO INTERRUPTS OR CALLING ROUTINE. 
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5.3.2 Interrupt Priority 


Interrupts on the TMS320C2x are prioritized in hardware. This allows inter- 
rupts that occur simultaneously to be serviced in a prioritized order. Some- 
times priority may be determined by frequency or rate of occurrence. An 
infrequent, but lengthy, interrupt service routine (ISR) might need to be in- 
terrupted by a more frequently occurring interrupt. In the routine of Example 
5-13, the ISR for INT1 temporarily modifies the interrupt mask register (IMR) 
to permit interrupt processing when an interrupt on INTO (but no other inter- 
rupt) occurs. When the routine has finished processing, the IMR is restored 
to its original state. Example 5-13 is written for the TMS320C25; however, 
AR4 can be substituted for AR7 when using the TMS32020. 


Example 5-13. Interrupt Service Routine 
TITL ‘INTERRUPT SERVICE ROUTINE! 


DEF ISR1 
REF IMR 


* 
* INTERRUPT PROCESSING FOR EXTERNAL INTERRUPT. INT1-. 
* 
* THIS ROUTINE MAY BE INTERRUPTED BY AN INTERRUPT FROM THE 
* EXTERNAL INTERRUPT INTO-, BUT NO OTHER. 
* 
ISR1 LARP AR7 ; 7 --> ARP 
MAR *- ; AR7 = AR7 - 1 
SST1 *- ; STL --> *AR7, AR7 = ART - 1 
SST *- ; STO --> *AR7, AR7 = AR7 - 1 
SACH *- ; ACCH --> *AR7, AR7 = AR7 - 1 
SACL *- ; ACCL --> *AR7, AR7 = AR7 - 1 
LDPK 0 ; DP = 0 
PSHD IMR ; IMR --> TOS 
LACK >0001 ; MASK FOR INTO- 
AND IMR ; MASK CURRENT IMR CONTENTS. 
SACL IMR ; ACC --> IMR 
EINT ; ENABLE INTERRUPTS. 
* 
* MAIN PROCESSING SECTION FOR ISR1. 
. f 
DINT ; DISABLE INTERRUPTS. 
LDPK 0 ; DP = 0 
POPD IMR ; TOS --> IMR 
LARP AR7 ; 7 --> ARP 
MAR *+ ; AR7 = AR7 + 1 
ZALS *+ ; *AR7 --> ACCL, AR7 = AR7 + 1 
ADDH *+ ; *AR7 --> ACCH, AR7 = AR7 + 1 
LST *+ ; *AR7 --> STO, AR7 = AR7 + 1 
LST1 *+ ; *AR7 --> ST1, AR7 = ART + 1 
EINT ; ENABLE INTERRUPTS. 
RET 
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5.4 Memory Management 


The structure of the TMS320C2x memory map is programmable and can vary 
for each application. Instructions are provided for moving blocks of data or 
program memory, configuring a block of on-chip data RAM as program 
memory, and defining part of external data memory as global. Explanations 
and examples of moving, configuring, and manipulating memory are provided 
in this section. 


5.4.1 Block Moves 


Since the TMS320C2x directly addresses a large amount of memory, blocks 
of data or program code can be stored off-chip in slow memories and then 
loaded on-chip for faster execution. Data can also be moved from on-chip to 
off-chip for storage or for multiprocessor data transfers. 


The BLKD and BLKP instructions facilitate memory-to-memory block moves. 
on the TMS320C2x. The BLKD instruction moves a block within data memory 
as shown in Example 5-14. Data may also be transferred between data mem- 
ory and program memory by means of the TBLR and TBLW instructions. The 
instructions IN and OUT are used to transfer data between the data memory 
and the I/O space. 


Example 5-14. Moving External Data to Internal Data Memory with BLKD 


* THIS ROUTINE USES THE BLKD INSTRUCTION TO MOVE A BLOCK OF 
* EXTERNAL DATA MEMORY (DATA PAGES 8 AND 9) TO INTERNAL BLOCK 
* Bl (DATA PAGES 6 AND 7). 

* 


MOVED . LARP AR2 


LRLK AR2,>300 ; DESTINATION IS BLOCK Bl IN RAM. 


; 
RPTK 255 ; REPEAT NEXT INSTRUCTION 256 TIMES. 
BLKD >400,*+ ; MOVE EXTERNAL BLOCK TO BLOCK Bl. 
RET ; RETURN TO MAIN PROGRAM. 


For systems that have external program memory but no external data memory, 
BLKP can be used to move program memory blocks into data memory. Ex- 
ample 5-15 demonstrates how to use the BLKP instruction. 


Example 5-15. Moving Program Memory to Data Memory with BLKP 


THIS ROUTINE USES THE BLKP INSTRUCTION TO MOVE DATA VALUES 
FROM PROGRAM MEMORY INTO DATA MEMORY. SPECIFICALLY, THE 
VALUES IN LOCATIONS 2, 3, 4, AND 5 IN PROGRAM MEMORY ARE 
MOVED TO LOCATIONS 512, 513, 514, AND 515 IN DATA MEMORY. 


+ + + + 


MOVEP LARP AR2 ; SET REFERENCE FOR INDIRECT ADDRESSING. 
LRLK AR2,512 ; LOAD BEGINNING OF BLOCK BO IN AR2. 


RPTK 3 ; SET UP LOOP. 
BLKP >2,*+ ; PUT DATA INTO DATA RAM. 
RET ; RETURN TO MAIN PROGRAM. 


Another method for transferring data from program memory into data memory 
makes use of the TBLR instruction. By using the TBLR instruction, a calcu- 
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lated, rather than predetermined, location of a block of data in program mem- 
ory may be specified for transfer. A routine using this approach is shown in 
Example 5-16. 


Example 5-16. Moving Program Memory to Data Memory with TBLR 


THIS ROUTINE USES THE TBLR INSTRUCTION TO MOVE DATA VALUES 
FROM PROGRAM, MEMORY INTO DATA MEMORY. BY USING THIS ROUTINE, 
THE PROGRAM MEMORY LOCATION IN THE ACCUMULATOR FROM WHICH 
DATA IS TO BE MOVED TO A SPECIFIC DATA MEMORY LOCATION CAN 
BE SPECIFIED. ASSUME THAT THE ACCUMULATOR CONTAINS THE 
ADDRESS IN PROGRAM MEMORY FROM WHICH TO TRANSFER THE DATA. 


+ + ee He H 


TABLER LARP AR3 


LRLK AR3,380; DESTINATION ADDRESS = PAGE 7. 


; 
RPTK 127 ; TRANSFER 128 VALUES. 
TBLR *+ ; MOVE DATA INTO DATA RAM. 
RET 7 RETURN TO CALLING PROGRAM. 


In cases where systems require that temporary storage be allocated in the 
program memory, TBLW can be used to transfer data from internal data 
memory to external program memory. The code in Example 5-17 demonstrates 
how this may be accomplished. 


Example 5-17. Moving Internal Data Memory to Program Memory with TBLW 


THIS ROUTINE USES THE TBLW INSTRUCTION TO MOVE DATA VALUES 
FROM INTERNAL DATA MEMORY TO EXTERNAL PROGRAM MEMORY. THE 
CALLING ROUTINE MUST SPECIFY THE DESTINATION PROGRAM MEMORY 
ADDRESS IN THE ACCUMULATOR. ASSUME THAT THE ACCUMULATOR 
CONTAINS THE ADDRESS IN PROGRAM MEMORY INTO WHICH THE DATA 
IS TRANSFERRED. 


+e Se eH HF HE 


TABLEW LARP AR4 
LRLK AR4,380; SOURCE ADDRESS = PAGE 7. 
RPTK 127 7 TRANSFER 128 VALUES. 
TBLW *+ ; MOVE DATA TO EXTERNAL PROGRAM RAM. 
RET 3 RETURN TO CALLING PROGRAM. 


The IN and OUT instructions are used to transfer data between the data 
memory and the !1/O space, as shown in Example 5-18 and Example 5-19. 


Example 5-18. Moving Data from 1/O Space into Data Memory with IN 
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* THIS ROUTINE USES THE IN INSTRUCTION TO MOVE DATA VALUES. 
* FROM THE I/O SPACE INTO DATA MEMORY. DATA ACCESSED FROM 

* I/O PORT 15 IS TRANSFERRED TO SUCCESSIVE MEMORY LOCATIONS 
* ON DATA PAGE 5. 

* 


INPUT LARP AR2 


LRLK AR2,>2CO; DESTINATION ADDRESS = PAGE 5. 


7 
RPTK 63 ; TRANSFER 64 VALUES. 
IN PA15,*+ ; MOVE DATA INTO DATA RAM. 
RET ; RETURN TO CALLING PROGRAM. 
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Example 5-19. Moving Data from Data Memory to I/O Space with OUT 


THIS ROUTINE USES THE OUT INSTRUCTION TO MOVE DATA VALUES 
FROM THE DATA MEMORY TO THE I/O SPACE. DATA IS TRANSFERRED 
TO I/O PORT 8 FROM SUCCESSIVE MEMORY LOCATIONS ON DATA 
PAGE 4. 


e+ + bt He 


OUTPUT LARP AR4 


LRLK AR4,>200 ; SOURCE ADDRESS = PAGE 4. 


; 
RPTK 63 ; TRANSFER 64 VALUES. 
OUT PA8,*+ ; MOVE DATA FROM DATA RAM. 
RET ; RETURN TO CALLING PROGRAM. 


5.4.2 Configuring On-Chip RAM 


The large amount of external memory and the configurability of on-chip RAM 
simplify the downloading of data or program memory into the TMS320C2x. 
Also, since data in the RAM is preserved when redefining on-chip RAM, block 
BO can be configured dynamically as either data or program memory. Figure 
5-1 illustrates the changes in on-chip RAM when switching configurations. 


On-chip memory is configured by a reset or by the CNFD and CNFP in- 
structions. Block BO is configured as data memory by executing CNFD or re- 
set. A CNFP instruction configures block BO as program memory. 
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PROGRAM DATA MEMORY 
BUS BUS LOCATIONS 
MEMORY-MAPPED 


DATA 0-5 
REGISTERS (>0000->0005) 


me 
ees 


aN 


DATA 96-127 
(>0060->007F) 


3 BLOCK B2 


S 
se 


ASRS 


DATA 512-767 
(>0200->02FF) 


3 BLOCK BO 
s BLOCK B1 


PROGRAM DATA MEMORY 
BUS BUS LOCATIONS 


DATA 768-1023 
(>0300->03FF) 


MEMORY-MAPPED 
DATA 0-5 
REGISTERS (>0000->0005) 


. BLOCK B2 DATA 96-127 


BLOCK BO PROG 65280-65535 
(>FFOO->FFFF) 


- | DATA 768-1023 
BLOCK Bi (>0300—>03FF) 


Figure 5-1. On-Chip RAM Configurations 


Configuring block BO as program memory is useful for implementing adaptive 
filters or similar applications at full speed with only on-chip memories. Exam- 
ple 5-20 illustrates the use of the configuration modes to utilize block BO as 
data and program memory while executing from on-chip program ROM. 
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Example 5-20. Configuring and Using On-Chip RAM 


TITL 'ADAPTIVE FILTER' 


DEF ADPFIR 
DEF X,Y 


+e + H 


* 


COEFFP EQU >FFOO 
COEFFD EQU >0200 
ONE EQU >7A 
BETA EQU >7B 
ERR EQU >7C 
ERRF EQU >7D 

Y EQU >7E 

x EQU >7F 
FRSTAP EQU >0380 
LASTAP EQU >0O3FF 
* 


* FINITE IMPULSE RESPONSE 
* 
ADPFIR CNFP 
; MPYK 0 
LAC ONE,14 
LARP AR3 
LRLK AR3,LASTAP 
FIR RPTK 127 
MACD COEFFP, *- 
CNFD 
APAC 
SACH Y,1 
NEG 
ADD xX,15 
SACH ERR,1 
* 


* LMS ADAPTATION OF FILTER 
* 


LRLK AR2,COEFFD 
LRLK AR3,LASTAP 
DMOV X 

LT ERRE . 

MPY *-,AR2 


ADAPT ZALH *,AR3 
ADD ONE,15 


* MPY *-,AR2 
SACH *+,0,AR1 
BANZ ADAPT, *-,AR2 


BO PROGRAM MEMORY ADDRESS 
BO DATA MEMORY ADDRESS 


CONSTANT ONE (DP=6) 
ADAPTATION CONSTANT (DP=6) 
SIGNAL ERROR (DP=6) 
ERROR FUNCTION (DP=6) 
FILTER OUTPUT (DP=6) 


NEWEST DATA SAMPLE (DP=6) 
NEXT NEWEST DATA SAMPLE 
OLDEST DATA SAMPLE 


me we Ne Ne Ne te Ne Ne Ne ON 


(FIR) FILTER. 


CONFIGURE BO AS PROGRAM: 
Clear the P register. 
Load output rounding bit. 


se oNe Ne 


Point to the oldest sample. 


~~ 


128-tap FIR filter. 
CONFIGURE BO AS DATA: 


=e Ne 


Store the filter output. 


~e 


Add the newest input. 
err(n) = x(n) - y(n) 


=e 


“e 


COEFFICIENTS. 


128-TAP FIR FILTER. 
errf(n) = beta * err(n) 
ROUND THE RESULT. 


we Ne Ne 


128 COEFFICIENTS TO UPDATE. 
POINT TO THE COEFFICIENTS. 

POINT TO THE DATA SAMPLES. 

INCLUDE NEWEST SAMPLE. 


ue Ne Ne ON 


P = 2*beta*err(n)*x(n-k) 


~e 


LOAD ACCH WITH ak(n). 
LOAD ROUNDING BIT. 

ak(nt+1) = ak(n) + B 

P = 2*beta*err(n)*x(n-k) 
STORE ak(nt+l). 

END OF LOOP TEST. 

RETURN TO CALLING ROUTINE. 


wee Ne Ne Ne me Ne 


THIS 128-TAP ADAPTIVE FIR FILTER USES ON-CHIP MEMORY BLOCK 
BO FOR COEFFICIENTS AND BLOCK Bl FOR DATA SAMPLES. THE 
NEWEST INPUT SHOULD BE IN MEMORY LOCATION X WHEN CALLED. 
THE OUTPUT WILL BE IN MEMORY LOCATION Y WHEN RETURNED. 
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Note that a more definitive example of the use of the TMS320C25 for adaptive 


filtering is provided in Section 5.7.3. 


5.4.3 Using On-Chip RAM for Program Execution 
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In using on-chip memory (block BO) for program execution, this memory must 
first be loaded with executable code from external memories while configured 
as data memory. On-chip execution is initiated by using the CNFP instruction 
to reconfigure block BO as program memory and performing a branch or call 
to an on-chip RAM address. By configuring block BO as program memory and 
executing from this internal memory, full-speed execution can be achieved in 
systems using slower external memory. Example 5-21 illustrates how a pro- 
gram may be written to be loaded into and executed from on-chip memory. 


One group of instructions, the branch/call instructions, are impacted by the 
location of execution. Normally, by using labels, the assembler properly de- 
termines the location to which a branch is taken. Since the code is relocated 
prior to execution from on-chip memory, it is necessary to alter the address 
determined by the assembler for branch instructions. This alteration is neces- 
sary so that the branch address that is determined can be consistent with the 
address space used during execution. In Example 5-21, this is accomplished 
by adding an offset value (OFFSET) to the branch label representing the des- 
tination address in the operand field for each branch instruction. The offset 
address is determined by use of an EQU (equate) directive that subtracts the 
assembler location of the code to be relocated (equivalent to base-O address- 
ing) from the base address of the relocation address (internal block BO ad- 
dress in this case). 
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Example 5-21. Program Execution from On-Chip Memory 


AORG O 
RESET B INIT 
* 


* BRANCHES FOR EXTERNAL OR INTERNAL INTERRUPTS FOLLOW HERE AT 
* THE DESIGNATED LOCATIONS AS REQUIRED. 
* . 


AORG >20 
* 
* A BRANCH INSTRUCTION AT PROGRAM MEMORY LOCATION O DIRECTS 
* PROCESSOR EXECUTION HERE. 
* 


* INITIALIZE THE PROCESSOR. 
* 


INIT ROVM DISABLE OVERFLOW MODE. 


SSXM SET SIGN EXTENSION. 

LDPK O POINT DP REGISTER TO DATA MEMORY PAGE O. 
SPM 0O NO SHIFT ON PRODUCT REGISTER OUTPUT. 
LARP AR4 USE AUXILIARY REGISTER 4 (SET ARP = 4). 


POINT AR4 TO PERIOD. REGISTER. 
SET ACCUMULATOR TO >FFFF. 


LARK AR4,PRD 
LALK OFFFF 


Se ee er Te Se Te Te TY 


SACL *+ LOAD PERIOD REGISTER WITH MAXIMUM VALUE. 
SACL *+ ENABLE ALL INTERRUPTS VIA IMR. 

ZAC CLEAR ACCUMULATOR. 

SACH * CLEAR GREG TO MAKE ALL MEMORY LOCAL. 


* LOAD TIME-CRITICAL CODE FROM EXTERNAL SLOW MEMORY TO INTERNAL RAM. 


LARP AR1 

LRLK AR1,BLKO 
RPTK PROGL-1 
BLKP PROG, *+ 


USE AUXILIARY REGISTER 1 (SET ARP = 1). 
POINT AR1 TO RECONFIGURABLE BLOCK BO. 
LOAD REPEAT COUNTER WITH BLOCK LENGTH. 
MOVE CODE FROM PROG MEMORY TO ON-CHIP RAM. 


we Ne Ne Ne 


* 


* INITIALIZE PARAMETERS FOR EXECUTION. 


LDPK 6 ; POINT DP REGISTER TO DATA MEMORY PAGE 6. 
LACK 1. ; SET ACCUMULATOR TO >OO01. 
SACL ONE ; STORE VALUE OF 1. 


LRLK AR1,BLKO+PROGL ; POINT AR1 TO INTERNAL MEMORY ADDRESS. 
RPTK COEFL-1 LOAD REPEAT COUNTER WITH BLOCK LENGTH. 
BLKP COEF, *+ MOVE DATA FROM PROG MEMORY TO ON-CHIP RAM. 


Se en eT 


CNFP CONFIGURE BLOCK BO AS PROGRAM MEMORY. 
LALK >FFOO LOAD ACC WITH PROG ADDR IN INTERNAL RAM. 
BACC BRANCH TO ON-CHIP EXECUTION ADDRESS. 


* 


* SIGNAL PROCESSING CODE TO BE EXECUTED FROM ON-CHIP RAM. 
* 
PROG EQU $ 
LPTS BIOZ GET-PROG+ONCHIP 
B LPTS-PROG+ONCHIP 
GET OUT FILOUT,PA2 
IN  FILIN,PA2 
LRLK AR1,BLK1+SIGNAL 
ZAC 
MPYK 0 


WAIT FOR INPUT SIGNAL. 

BRANCH IF NO SIGNAL. 

OUTPUT LAST FILTER OUTPUT. 

INPUT NEW SIGNAL SAMPLE. 

POINT AR1 TO SIGNAL DATA TO PROCESS. 
CLEAR THE ACCUMULATOR. 

CLEAR THE P REGISTER. 


wee Ne Ne Ne Ne Ne 
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PROGE 
PROGL 
ONCHIP 
* 


RPTK 
MACD 
APAC 
SACH 
B 
EQU 
EQU 
EQU 


£5 


ONCHIP+COEFF , *- 


FILOUT,1 
LPTS-PROG+ONCHIP 


$ 


PROGE-PROG 
>FFOO 


REPEAT MACD INSTRUCTION FOR 16 TAPS. 
MULTIPLY, ACCUMULATE, SAMPLE DELAY. 
ACCUMULATE THE LAST PRODUCT. 

SAVE THE RESULT. 

LOOP TO WAIT FOR NEXT SAMPLE. 

SAVE THE RESULT. 

PROGRAM CODE LENGTH. 

BASE ADDRESS EXECUTION. 


we me Me Ne Me Ne Ne we 


* COEFFICIENT DATA TO BE LOADED INTO ON~CHIP RAM. 
* 


COEF 


COEFE 


COEFL 
* 


DATA 
DATA 
DATA 
DATA 
EQU 
EQU 


385,-1196,1839,-2009 
1390,407,-4403,19958 
19958,-4403 ,407,1390 
-2009,1839,-1196,385 


COEFE-COEF 


* INTERNAL MEMORY CONSTANTS. 
* 


BLKO 
BLK1 
* 


* DATA 
* 


DRR 
DXR 
TIM 
PRD 
IMR 
GREG 
* 


* DATA 
* 

BO 
COEFF 
* 

* DATA 
* 

ONE 
FILOUT 


FILIN 
SIGNAL 


5-30 


EQU 
EQU 


PAGE 


DORG 
BSS 
BSS 
BSS 
BSS 
BSS 
BSS 


PAGE 


DORG 
BSS 
BSS 


PAGE 


DORG 
BSS 
BSS 
BSS 
BES 
END 


ae 


; COEFFICIENT DATA LENGTH. 


>200 
>300 
0 (BLOCK B2) - DATA MEMORY LABELS. 
6) 
1 ; SERIAL PORT DATA RECEIVE REGISTER. 
RE ; SERIAL PORT DATA TRANSMIT REGISTER. 
1 > TIMER REGISTER. 
1 ; PERIOD REGISTER. 
1 ; INTERRUPT MASK REGISTER. 
1 ; GLOBAL MEMORY ALLOCATION REGISTER. 
4 (BLOCK BO) - DATA MEMORY LABELS. 
0 
PROGL ; LOCATIONS FOR INTERNAL PROGRAM CODE. 
COEFL LOCATIONS FOR COEFFICIENT MEMORY. 
6 (BLOCK Bl) - DATA MEMORY LABELS. 
¢) 
; RESERVED FOR DATA VALUE OF 1. 
1 ; FILTER OUTPUT SIGNAL VALUE. 
L ; FILTER INPUT SIGNAL VALUE. 
14 ; LAST SIGNAL DELAY VALUE. 
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5.5 Fundamental Logical and Arithmetic Operations 


Although the TMS320C2x instruction set is oriented toward digital signal 
processing, the same fundamental operations of a general-purpose processor 
are included. This section explains basic operations of the TMS320C2x Cen- 
tral Arithmetic Logic Unit (CALU), particularly accumulator operations, the 
status register effect on data processing, and bit manipulation. 


The TMS320C2x provides a complete set of logical operations, including 
AND, OR, XOR, and CMPL (complement) instructions. This enables the de- 
vice to perform any !ogical function. These instructions may be used to per- 
form sign magnitude to two’s complement or the reverse conversions. 


The contents of the accumulator may be stored in data memory using the 
SACH and SACL instructions or stored in the stack by using the PUSH in- 
struction. The accumulator may be loaded from data memory using the ZALH 
and ZALS instructions, which zero the accumulator before loading the data 
value. The ZAC instruction zeroes the accumulator. POP can be used to re- 
store the accumulator contents from the stack. 


The accumulator is also affected by the ABS and NEG instructions. ABS re- 

. places the contents of the accumulator with the absolute value of its contents. 
NEG generates the arithmetic complement of the accumulator in two’s- 
complement form. 


5.5.1 Status Register Effect on Data Processing 


Three data processing options allow the ALU to automatically suppress sign 
extension, manage overflow, or scale product accumulations. These options 
are enabled or disabled through bits in the status registers. These options 
function in parallel with normal execution of the instructions and cause no 
additional machine cycles, therefore no performance overhead. 


The sign-extension mode option is used to determine whether or not the 
shifted data values fetched for ALU operations should be sign-extended. The 
SXM status bit controls this operation. This bit is set to ‘1’ for enabling sign 
extension using the SSXM instruction, and set to ‘0’ for suppressing sign ex- 
tension using the RSXM instruction. This operation affects all the instructions 
that include a shift of the incoming data value (i.e. ADD, ADDT, ADLK, LAC, 
LACT, LALK, SBLK, SFR, SUB, and SUBT). 


The overflow mode option is uscd to minimize the effects of an arithmetic 
overflow by forcing the accumulzior to saturate at the largest positive value 
(or in the case of underflow, the iargest negative value). The OVM status bit 
controls this operation. The overflow mode is enabled by setting the OVM bit 
to a ‘1’ using the SOVM instruction, and reset using the ROVM instruction. 
This feature affects all arithmetic operations in the ALU. 


The product register shift msde option forces all products to be shifted before 
they are accumulated. The products can be left-shifted one bit to delete the 
extra sign bit in the multiply of two 16-bit signed numbers. The products can 
be left-shifted four bits to delete the extra sign bits in multiplying a 16-bit data 
value by a 13-bit constant. Tne product shifter can also be used to shift all 
products six bits to the right ‘9 allow up to 128 product accumulations with- 
out the threat of an arithmetic. overflow, thereby avoiding the overhead of ov- 
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erflow management. The shifter can be disabled to cause no shift in the 
product when working with integer or 32-bit precision operations. This also 
maintains compatibility with TMS320C1x code. These operations are con- 
trolled by the value contained in the PM bits of status register ST1. The PM 
bits are set using the SPM instruction. This feature affects all the instructions 
that use the product of the multiplier (i.e, APAC, LTA, LTD, LTP, LTS, MAC, 
MACD, MPYA, MPYS, PAC, SPAC, SPH, SPL, SQRA, and SQRS). 


5.5.2 Bit Manipulation 


The BIT instruction tests any of the 16 bits of the addressed data word. The 
specified bit is copied into the TC of the status register. The bit tested is 
specified by a bit code in the opcode of the instruction. Either the BBZ 
(branch on TC bit = 0) or BBNZ (branch on TC bit = 1) instructions check 
the bit and allow branching to a service routine. 


Bit testing is useful in control applications where a number of states or con- 
ditions may be latched externally and read into the TMS320C2x via an IN in- 
struction. At this point, individual bits can be tested and branches taken for 
appropriate processing. 


Since the BIT instruction requires the bit code to be specified with the in- 
struction, it cannot be placed in a loop to test several different bits of a data 
word or bits determined by prior processing for efficient use. The TMS320C2x 
also has a BITT instruction in which the bit code is specified in the T register. 
Since the T register can easily be modified, BITT may be used to test all bits 
of a data word if placed within a loop or to test a bit location determined by 
past processing. 


Example 5-22. Using BIT and BBZ 
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* THIS ROUTINE USES THE BIT INSTRUCTION TO TEST THE CONDITION 
* OF AN EXTERNAL MUX. BIT 4 DETERMINES THE UTILITY OF THE 
* REMAINING DATA. IF ZERO, A COUNTER IS INCREMENTED. IF ONE, 
* ADDITIONAL PROCESSING OCCURS AND THE COUNTER IS CLEARED. 
* THE ROUTINE IS INVOKED WHENEVER A TIMER INTERRUPT OCCURS. 
* 
TIME SST STO ; SAVE STATUS REGISTER STO. 

LDPK 0 

LARP AR3 

IN DAT,PAS j; READ IN VALUE. 

BIT DAT,>B ; TEST BIT 4. 

BBZ INCR ; BRANCH AND INCREMENT IF POSITIVE. 

LARK AR3,0 ; CLEAR THE COUNTER. 

LST STO ; RELOAD THE STATUS REGISTER. 

EINT ; ENABLE INTERRUPTS. 

RET ; RETURN TO INTERRUPTED ROUTINE. 
* 
INCR MAR *+ ; INCREMENT THE COUNTER. 

LST STO ; RELOAD THE STATUS REGISTER. 

EINT ; ENABLE INTERRUPTS. 

RET ; RETURN TO INTERRUPTED ROUTINE. 
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Example 5-23. Using BITT and BBNZ 


INTERRUPT 


+e + + H+ H+ 


TIME SST 
LDPK 
LARP 
LAR 
LRLK 
IN 
B 

TMLOOP LT 
BITT 
BBNZ 


LTEST BANZ 


OCCURS. 


STO ; 
0 

AR3 

AR3,BCNT ; 
AR3,BTBL ; 
DAT,PA8 ; 
LTEST,*-,4 
*+,3 7 
DAT : ; 
LTEST ; 


TMLOOP,*-,4 
STO 


~e me Ne 


SAVE 


LOAD 
LOAD 
READ 


LOAD 
TEST 


STATUS REGISTER STO. 


COUNT OF ACTIVE BITS. 
THE BIT TABLE ADDRESS. 
IN VALUE. 


BIT CODE. 
SPECIFIED BIT. 


BRANCH IF BIT IS ONE. 


RELOAD THE STATUS REGISTER. 
ENABLE INTERRUPTS. 


RETURN TO INTERRUPTED ROUTINE. 


THIS ROUTINE USES THE BITT INSTRUCTION TO TEST THE CONDITION 
OF AN EXTERNAL MUX. A BIT IN THE MUX IS SIGNIFICANT ONLY 
WHEN PRIOR PROCESSING HAS DESIGNATED THE BIT TO BE ACTIVE. 
INDIVIDUAL PROCESSING WILL TAKE PLACE BASED UPON THE STATE 
OF THE TESTED BIT. THE BITS ARE TESTED EACH TIME A TIMER 
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5.6 Advanced Arithmetic Operations 


The TMS320C2x provides special instructions that facilitate efficient exe- 
cution of arithmetic-intensive DSP algorithms, such as MACD, SQRA, SUBC, 
and NORM. Explanations and examples of how to use these instructions with 
overflow management, and for data moves, multiplications, division, float- 
ing-point arithmetic, indexed addressing, and extended-precision arithmetic 
are included in this section. 


5.6.1 Overflow Management 
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The TMS320C25 has four features that can be used to handle overflow man- 
agement. These include the branch on overflow conditions, accumulator sat- 
uration (overflow mode), product register right shift, and accumulator right 
shift. These features provide several options for overflow protection within an 
algorithm. 


A program can branch to an error handler routine on an overflow of the ac- 
cumulator by using the BV (branch on overflow) instruction or bypass an error 
handler by using the BNV (branch if no overflow) instruction. These in- 
structions can be performed after any ALU operation that may cause an ac- 
cumulator overflow. 


The overflow mode is a feature useful for DSP applications. This mode sim- 
ulates the saturation effect characteristic of analog systems. When enabled, 
any overflow in the accumulator results in the accumulator contents being 
replaced with the largest positive value (>7FFFFFFF) if the overflowed num- 
ber is positive, or the largest negative value (>80000000) if negative. The 
overflow mode is controlled by the OVM bit of status register STO and can be 
changed by the SOVM (set overflow mode), ROVM (reset overflow mode), 
or LST (load status register) instructions. Overflows can be detected in soft- 
ware by testing the OV (overflow) bit in status register STO. When a branch 
is used to test the overflow bit, OV is automatically reset. Note that the OV 
bit does not function as a carry bit. It is set only when the absolute value of a 
number is too large to be represented in the accumulator, and it is not reset 
except by specific instructions. 


Another method of overflow management, which applies to multiply-accu- 
mulate operations, is the use of the right shifter of the product register. The 
right shifter, which operates with no cycle overhead, allows up to 128 accu- 
mulations without the possibility of an overflow. The least-significant six bits 
of the product are lost, and the MSBs are filled with sign bits. This feature is 
initiated by setting the PM bits of status register ST1 to ‘11’ using the SPM 
or LST1 instructions. 


The TMS320C2x also has a right shift of the accumulator (using the SFR in- 
struction) to scale down the accumulator when it nears overflow. 
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5.6.2 Scaling 


Scaling the data coming into the accumulator or already in the accumulator is 
useful in signal processing algorithms. This is frequently necessary in adapta- 
tion or other algorithms that must compute and apply correction factors or 
normalize intermediate results. Scaling and normalizing are implemented on 
the TMS320C2x via right and left shifts in the accumulator and shifts of data 
on the incoming path to the accumulator. 


Right and left shifts of the accumulator can be performed using the SFL and 
SFR instructions. SFL performs a logical left shift. SFR performs logical or 
arithmetic right shifts depending on the state of the SXM bit in the status 
register. A one in the SXM bit, corresponding to sign-extension enabled, 
causes an arithmetic shift to be performed. 


In addition to the shift instructions, data can be left-shifted O to 15 bits when 
the accumulator is loaded using a LAC instruction, and left-shifted 0, 1, or 4 
bits on the TMS32020 or 0 to 7 bits on the TMS320C25 when storing from 
the accumulator using SACH or SACL instructions. These shifts can be used 
for loading numbers into the high 16 bits of the accumulator and renormaliz- 
ing the result of a multiply. The incoming left shift of 0 to 15 bits can be 
supplied in the instruction itself or can be taken from the lowest four bits of 
the T register. Left shifts of data fetched from data memory are available for 
loading the accumulator (LAC/LACT), adding to the accumulator 
(ADD/ADDT), and subtracting from the accumulator (SUB/SUBT). The 
contents of the P register may also be shifted prior to accumulation. 


5.6.3 Moving Data 


Many DSP applications must perform convolution operations or other oper- 
ations similar in form. These operations require data to be shifted or delayed. 
The DMOV, LTD, and MACD instructions can perform the needed data moves 
for convolution. 


The data move function allows a word to be copied from the currently ad- 
‘dressed data memory location in on-chip RAM to the next higher location 
while the data from the addressed location is being operated upon (e.g., by 
the CALU). The data move and the CALU operation are performed in the same 
cycle. In addition, an ARAU operation may also be performed in the same cy- 
cle when using the indirect addressing mode. The data move function is useful 
in implementing algorithms, such as convolutions and digital filtering, where 
data is being passed through a time window. It models the z™ ' delay operation 
encountered in those applications. The data move function is continuous 
across the boundary of the on-chip data memory blocks BO, B1, and B2. 
However, the data move function cannot be used if off-chip memory is refer- 
enced. 


In Example 5-24, the following equation is implemented: 


2 
Y(n) =D) Hk) X(n-k) 
k=0 
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where the H values stay the same, and the X values are shifted each time the 
microprocessor performs one of the following series of multiplications (similar 
to operations performed in FIR filters): 


First Series: Y¥(2) = (HO)(X2)+(H1)(X1)+(H2) (XO) 
Second Series: Y(3) = (HO)(X3)+(H1)(X2)+(H2)(X1) 
Third Series: Y(4) = (HO)(X4)+(H1)(X3)+(H2)(X2) 


The MACD instruction, which combines accumulate and multiply operations 
with a data move, is tailored to the type of calculation shown in the summa- 
tion equation above. In order to use MACD, the H values have been stored 
in block BO, configured as program RAM, and the X values have been read 
into block B1 of data RAM as shown in Figure 5-2. 


PROGRAM DATA 
BLOCK BO BLOCK B1 


>FFOO »>300 
>FFO1 >301 


>FEO2 302 | x0 |e— ARI 
(COEFFICIENTS) (SAMPLES) 


Figure 5-2. MACD Operation 


Also in Example 5-24, the summation in the above equation is performed in 
the reverse order, i.e., from K = 2 to 0, due to the operation of the data move 
function. This results in the oldest X value being used and discarded first. 


If the MACD instruction is replaced with the following two instructions, then 
the MAC instruction can be utilized with the same results. 


MAC 7 
DMOV *- 


In cases where many more than three MACD instructions are required, the 
RPT or RPTK instructions may be used with MACD, yielding the same com- 
putational results but using less assembly code. 
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Example 5-24. Using MACD for Moving Data 


THIS ROUTINE IMPLEMENTS A SINGLE PASS OF A THIRD-ORDER FIR 
FILTER. IT IS ASSUMED THAT THE H AND X VALUES HAVE ALREADY 
BEEN LOADED INTO THEIR RESPECTIVE MEMORY LOCATIONS, THAT 
THE ACCUMULATOR AND P REGISTER ARE BOTH RESET TO ZERO, AND 
THAT AR1 IS POINTING AT XO. NOTE THAT THE MACD INSTRUCTION 
MAY BE USED IN THE REPEAT MODE, BUT IT IS NOT IMPLEMENTED 
HERE. 


+ + Fe HF FH eH 


CONFIGURE BLOCK BO AS PROGRAM MEMORY. 
AR1 SHOULD POINT AT THE X VALUES. 

P = (X0)(H2) 

ACC = (X0) (H2) 


FIR CNFP 
LARP 1 
MAC >FFOO,*- 
MACD >FFO1,*- 


wee Ne me Ne we Ne Ne 


MACD >FFO2,* ACC = (X0O)(H2) + (X1)(H1) 

APAC ACC = (XO) (H2) + (X1)(H1) + (X2) (HO) 
CNFD CONFIGURE BLOCK BO AS DATA MEMORY. 
RET RETURN TO MAIN PROGRAM. 


5.6.4 Multiplication 


The TMS320C2x hardware multiplier normally performs two’s-complement 
16-bit by 16-bit multiplies and produces a 32-bit result in one processor cycle. 
A single TMS320C25 instruction, MPYU, can.be used to multiply two 16-bit 
unsigned numbers. To multiply two operands, one operand must be loaded 
into the T register (TR). The second operand is moved by the multiply in- 
struction to the multiplier, which then produces the product in the P register 
(PR). Before another multiply can be performed, the contents of the PR must 
be moved to the accumulator. A single-multiply program is shown in Example 
5-25. By pipelining multiplies and PR moves, most multiply operations can 
be performed with a single instruction. 


A common operation in DSP algorithms is the summation of products. The 
MAC instruction, normally performed in multiple cycles, adds the contents of 
the PR to the accumulator and then simultaneously reads two values and 
multiplies them. When using the MAC instruction, a data memory value is 
multiplied by a program memory value. One of the operands can come from 
block B1 or B2 in on-chip data memory while the other operand may come 
from block BO. Block BO must be configured as program memory when it 
supplies the second operand. Pipelining of the MAC instruction with a repeat 
instruction results in an execution time for each succeeding multiply-and-ac- 
cumulate operation of only one cycle. 
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Example 5-25. Multiply 


* THIS ROUTINE MULTIPLIES TWO VALUES IN DATA MEMORY LOCATIONS 
* >200 AND >201 WITH THE RESULT STORED IN >202 AND >203. 
* 


MUL LRLK AR1,>200 POINT AT BLOCK BO. 


~ 


LARP 1 

LT * + ; GET FIRST VALUE AT >200. 
MPY *+ ; MULTIPLY BY VALUE AT >201. 
PAC ; PUT RESULT IN ACCUMULATOR. 
SACL *+ ; STORE LOW WORD AT >202. 
SACH * ; STORE HIGH WORD AT >203. 
RET 7 RETURN TO MAIN PROGRAM. 


The pipelining of the MAC and MACD. instructions incurs a certain amount 
of overhead in execution. In those cases where speed is more critical than 
program memory, it may be beneficial to use LTA or LTD and MPY in- 
structions rather than MAC or MACD. Example 5-26 and Example 5-27 show 
an implementation of multiply-accumulates using the MAC instruction. Ex- 
ample 5-28 shows an implementation of multiply-accumulates using the 
LTA-MPY instruction pair. Figure 5-3, Figure 5-4, and Figure 5-5 provide 
graphically the information necessary to determine the efficiency of use for 
each of the techniques. 


Example 5-26. Multiply-Accumulate Using the MAC Instruction (TMS32020) 


* CLOCK TOTAL CLOCK PROGRAM TOTAL PROGRAM 
* CYCLES CYCLES MEMORY MEMORY 
* 

LARP AR1 % 1 

LRLK AR1,>300 2 2 

CNFP 1 iq 

ZAC 1 1 

MPYK 0 1 if 

RPTK N-1 1 1 

MAC >FFOO,*+ 2 +N 2 

APAC 1 10 + N 1 10 


Example 5-27. Multiply-Accumulate Using the MAC Instruction (TMS320C25) 
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* CLOCK TOTAL CLOCK PROGRAM TOTAL PROGRAM 
* CYCLES CYCLES MEMORY MEMORY 
* 

LARP AR1 i 1 

LRLK AR1,>300 2 2 

CNFP 1 1 

ZAC 1 it 

MPYK 0 1 if 

RPTK N-1_ 1 1 

MAC >FFOO,*+ 3 +N 2 

APAC 1 11+N 1 10 
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Example 5-28. Multiply-Accumulate Using the LTA-MPY Instruction Pair 


* CLOCK TOTAL CLOCK PROGRAM TOTAL PROGRAM 
* CYCLES CYCLES MEMORY MEMORY 
* 

ZAC 1 1 

LT. pL z 1 

MPY Cl 1 1 

LTA D2 1 1 

MPY C2 1 1 

. 2N 2N 

LTA DN i 1 

MPY CN 1 a 

APAC a 2 + 2N 1 2 + 2N 


EXECUTION IN CLOCK CYCLES 


so eee 


"| @= MAC IMPLEMENTATION. 
.. | = LTACMPY IMPLEMENTATION 
: x= BREAK-EVEN POINT 


123 4 5 6 7 8 9 10 11 


NUMBER OF MULTIPLY-ACCUMULATES TO BE 
PERFORMED 


Figure 5-3. Execution Time vs. Number of Multiply-Accumulates 
(TMS32020) 
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EXECUTION IN CLOCK CYCLES 


“Pe = MAC IMPLEMENTATION 
. | O= LTA-MPY IMPLEMENTATION 
' x= BREAK-EVEN POINT 


12 3 4 5 6 7 8 9 70 11 


NUMBER OF MULTIPLY-ACCUMULATES TO BE 
PERFORMED 


Figure 5-4. Execution Time vs. Number of Multiply-Accumulates 
(TMS320C25) 
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PROGRAM MEMORY IN WORDS 


MAC IMPLEMENTATION 


: :-@ecrt 
z+: O = LTA-MPY IMPLEMENTATION 
: X = BREAK-EVEN POINT 


12 3 4 5 6 7 8 g 10 11 


NUMBER OF MULTIPLY-ACCUMULATES TO BE 
PERFORMED 


Figure 5-5. Program Memory vs. Number of Multiply-Accumulates 


In numerical analysis, it is often necessary to square numbers along with 
adding or subtracting. The TMS320C2x has two instructions, SQRA and 
SORS, that accomplish this in a single machine cycle. The result of the pre- 
vious operation in the PR is first added to the accumulator if SQRA is used, 
or subtracted from the accumulator if SORS is used. Then the data value ad- 
dressed is squared, and the result is stored in the PR. Example 5-29 uses the 
SQRA instruction to perform the computation. 
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Example 5-29. Using SORA 


+ + + HF HF 


THIS ROUTINE USES THE SQRA INSTRUCTION TO COMPUTE THE 
SQUARE OF THE DISTANCE BETWEEN TWO POINTS WHERE D**2 
IS DEFINED AS FOLLOWS: 

D**2 = (XA - XB)**2 + (YA - YB)**2 


DIST LAC XA 


SUB XB 
SACL XT ; XT = XA - XB 
* 
LAC YA 
SUB YB 
SACL YT ; YT = YA - YB 
* 
SQRA XT ; (P) = XT**2 
ZAC ; (ACC) = 0 
SQRA YT ; (P) = YT**2, (ACC) = XT**2 
APAC ; (ACC) = XT**2 + ¥T**2 = D**2 
* 
RET ; RETURN TO MAIN PROGRAM. 


When performing multiply-and-accumulate operations, it may be desirable to 
shift the product before adding it to the accumulator. This can be accom- 
plished simultaneously with the MAC instruction by using the product shift 
mode on the TMS320C25. This mode, controlled by two bits in the PM field 
of status register ST1, shifts the value from the PR while it is transferred to the 
accumulator. The contents of the PR are not shifted. 


5.6.5 Division 
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Division is implemented on the TMS320C2x by repeated subtractions using 
SUBC, a special conditional subtract instruction. Given a 16-bit positive div- 
idend and divisor, the repetition of the SUBC command 16 times produces a 
16-bit quotient in the low accumulator and a 16-bit remainder in the high 
accumulator. 


SUBC implements binary division in the same manner as is commonly done 
in long division. The dividend is shifted until subtracting the divisor no longer 
produces a negative result. For each subtract that does not produce a negative 
answer, a one is put in the LSB of the quotient and then shifted. The shifting 
of the remainder and quotient after each subtract produces the separation of 
the quotient and remainder in the low and high halves of the accumulator. 


There are similarities between long division and the SUBC method of division. 
Both methods are used to divide 33 by 5. 


ee —— — — 
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LONG DIVISION: 
0000000000000110 Quotient 
0000000000000101 
-101 
110 
-104 
11 Remainder 
SUBC METHOD: 
32 HIGH ACC | LOW ACC s COMMENT 
ooo0d00c0000000000 0000000000100001 (1) Dividend js loaded into ACC. The 
-10 1000000000000000 divisor is left-shifted 15 and sub- 
~10  0111111111011111 tracted from ACC. The subtraction 


is negative, so discard the result 
| and shift left the ACC one bit. 


coooo0000000000G 0000000001000010 (2) 2nd subtract produces negative 
-10 1000000000000000 answer, so discard result and shift 
= TT 0 ACC (dividend) left. 


e @ 
e e 
e e 
| 
0000000000000100 0010000000000000 (14) 14th SUBC command. The result 
-10 1000000000000000 Is positive. Shift result left and 
oooo000000000001 1010000000000000 replace LSB with ‘1’. 


| | 
0000000000000011 0100000000000001 (15) Result is again positive. Shift 
-10 1000000000000000 result left and replace LSB with '1’. 


0000000000000000 1100000000000001 


| | | 


0000000000000001 1000000000000011 (16) Last subtract. Negative answer, so 


-10 1000000000000000 discard result and shift ACC left. 
STATA TOT 
0000000000000011 0000000000000110 Answer reached after 16 SUBC 


instructions. 


REMAINDER QUOTIENT 


The condition of the divisor, less than the shifted dividend, is determined by 
the sign of the result, both the dividend and divisor must be positive when 
using the SUBC command. Thus, the sign of the quotient must be determined 
and the quotient computed using the absolute value of the dividend and di- 
visor. 


Integer and fractional division can be implemented with the SUBC instruction 
as shown in Example 5-30 and Example 5-31, respectively. When imple- 
menting a divide algorithm, it is important to know if the quotient can be re- 
presented as a fraction and the degree of accuracy to which the quotient is to 
be computed. For integer division, the absolute value of the numerator must 
be greater than the absolute value of the denominator. For fractional division, 
the absolute value of the numerator must be less than the absolute value of 
the denominator. 
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Example 5-30. Using SUBC for Integer Division 
* THIS ROUTINE IMPLEMENTS INTEGER DIVISION. 
* 


DN1 LT NUMERA ; GET SIGN OF QUOTIENT. 
MPY DENOM 


PAC 

SACH TEMSGN ; SAVE SIGN OF QUOTIENT. 
LAC DENOM 

ABS 


SACL DENOM 
LAC NUMERA 
ABS 


MAKE DENOMINATOR POSITIVE. 
ALIGN NUMERATOR. 


we Ne 


IF DIVISOR AND DIVIDEND ARE ALIGNED, DIVISION CAN START 
HERE. 


+ ee Ft 


RPTK 15 
SUBC DENOM ; 16-CYCLE DIVIDE LOOP. 
SACL QUOT 
LAC TEMSGN 
BGEZ DONE 
ZAC 
SUB QuoT 
SACL QUOT 

DONE LAC QUOT 
RET 


DONE IF SIGN IS POSITIVE. 


~~ 


NEGATE QUOTIENT IF NEGATIVE. 


“ee 


RETURN TO MAIN PROGRAM. 


~~ 


Example 5-31. Using SUBC for Fractional Division 


* THIS ROUTINE IMPLEMENTS FRACTIONAL DIVISION. 

* 

DN1 LT NUMERA ; GET SIGN OF QUOTIENT. 
MPY DENOM 
PAC 
SACH TEMSGN 
LAC DENOM 
ABS 
SACL DENOM 
ZALH NUMERA 
ABS 


~ 


SAVE SIGN OF QUOTIENT. 


MAKE DENOMINATOR POSITIVE. 
ALIGN NUMERATOR. 


we oN 


IF DIVISOR AND DIVIDEND ARE ALIGNED, DIVISION CAN START 
HERE. 


+ + 


RPTK 14 

SUBC DENOM ; 15-CYCLE DIVIDE LOOP. 

SACL QUOT 

LAC TEMSGN 

BGEZ DONE ; DONE IF SIGN IS POSITIVE. 

ZAC 

SUB QUOT 

SACL QUOT ; NEGATE QUOTIENT IF NEGATIVE. 
DONE LAC QUOT 

RET ; RETURN TO MAIN PROGRAM. 
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5.6.6 Floating-Point Arithmetic 


Floating-point numbers are often represented on microprocessors in a two- 
word format of mantissa and exponent. The mantissa is stored in one word. 
The exponent, the second word, indicates how many bit positions from the !eft 
the decimal point is located. If the mantissa is 16 bits, a 4-bit exponent is 
sufficient to express the location of the decimal point. Because of its 16-bit 
word size, the 16/4-bit floating-point format functions most efficiently on the 
TMS320C2x. The theory and implementation of floating-point arithmetic has 
been presented in an application report in the book, Digital Signal Processing 
Applications with the TMS320 Family. 


Operations in the TMS320C2x central ALU are performed in two’s- 
complement fixed-point notation. To implement floating-point arithmetic, 
operands must be converted to fixed point for arithmetic operations, and then 
converted back to floating point. 


Conversion to floating-point notation is performed by normalizing the input 
data (i.e., shifting the MSB of the data word into the MSB of the internal 
memory word). The exponent word then indicates how many shifts are re- 
quired. To multiply two floating-point numbers, the mantissas are multiplied 
and the exponents added. The resulting mantissa must be renormalized. 
(Since the input operands are normalized, no more than one left shift is re-- 
quired to normalize the result.) 


Floating-point addition or subtraction requires shifting the mantissa so that 
the exponents of the two operands match. The difference between the expo- 
nents is used to left-shift the lower power operand before adding. Then, the 
output of the add must be renormalized. 


TMS320C2x instructions useful in floating-point operations are the NORM, 
LACT, ADDT, and SUBT instructions. NORM may be used to convert fixed- 
point numbers to floating-point. LACT may be used to convert back to 
fixed-point numbers. Addition and subtraction can be computed in floating 
point using ADDT and SUBT. 


Example 5-32 and Example 5-33 perform a floating-point multiply on the 
TMS32020 and TMS320C25, respectively. The mantissas are assumed to be 
in Q15 format. Q15, one of the various types of Q format, is a number repre- 
sentation commonly used when performing operations on non-integer num- 
bers. In Q format, the Q number (15 in Q15) denotes how many digits are 
located to the right of the binary point. A 16-bit number in Q15 format, 
therefore, has an assumed binary point immediately to the right of the most 
significant bit. Since the most significant bit constitutes the sign of the num- 
ber, then numbers represented in Q15 may take on values from +1 (repres- 
ented by +0.99997...) to -1. 
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Example 5-32. 
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e + Fe HF HH HH HF HF HK HF HF HF He HF 


Using NORM for Floating-Point Multiply (TMS32020) 


THIS SUBROUTINE PERFORMS A FLOATING-POINT MULTIPLY USING 
THE NORM INSTRUCTION. THE INPUTS AND OUTPUTS ARE OF THE 
FORM: 


C = MC * 2**EC 


SINCE THE MANTISSAS, MA AND MB, ARE NORMALIZED, MC CAN BE 
NORMALIZED WITH A LEFT SHIFT OF EITHER O OR 1 IN THE 
ACCUMULATOR. THE EXPONENT OF THE RESULT IS ADJUSTED 
APPROPRIATELY. FOR EXAMPLE, MULTIPLICATION OF THE TWO 
NUMBERS A AND B, WHERE A = 0.1 * 2**2 AND B = QO.1 * 2**4, 
PROCEEDS AS FOLLOWS: 


B= 0.01 * 2**6 
B= 0.1 * 2**5 (NORMALIZED RESULT) 


MULT LAC EA 


ADD EB ; EC = EXPONENT OF RESULT BEFORE 
SACL EC ; NORMALIZATION. 

LT MA 

MPY MB 

PAC ; (ACC) = MA * MB 

SFL ; TAKES CARE OF REDUNDANT SIGN BIT. 
LARP ARO 

LAR ARO,O ; ARO IS INITIALIZED TO 0. 

NORM ; FINDS MSB AND MODIFIES ARO. 

SACH MC ; MC = MA * MB (NORMALIZED) 

SAR ARO,TMP 

EAC, “2C 

SUB TMP 

SACL EC 

RET ; RETURN TO MAIN PROGRAM. 
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Example 5-33. 


+ + F + e + HF HF HF HK KF eH HF HF H HF 


Using NORM for Floating-Point Multiply (TMS320C25) 


THIS SUBROUTINE PERFORMS A FLOATING-POINT MULTIPLY USING 
THE NORM INSTRUCTION. THE INPUTS AND OUTPUTS ARE OF THE 
FORM: 


C = MC * 2**EC 


SINCE THE MANTISSAS, MA AND MB, ARE NORMALIZED, MC CAN BE 
NORMALIZED WITH A LEFT SHIFT OF EITHER O OR 1 IN THE 
ACCUMULATOR. THE EXPONENT OF THE RESULT IS ADJUSTED 
APPROPRIATELY. FOR EXAMPLE, MULTIPLICATION OF THE TWO 
NUMBERS A AND B, WHERE A = 0.1 * 2**2 AND B = 0.1 * 2%**4, 
PROCEEDS AS FOLLOWS: 


1) A* B= 0.01 * 2**6 

2) A* B= 0.1 * 2**5 (NORMALIZED RESULT) 
MULT LAC EA 

ADD EB ; EC = EXPONENT OF RESULT BEFORE 

SACL EC ; NORMALIZATION. 

LT MA 

MPY MB 

PAC ; (ACC) = MA * MB 

SFL ; TAKES CARE OF REDUNDANT SIGN BIT. 

LARP ARS5 

LAR ARS,EC ; ARS IS.INITIALIZED WITH EC. 


FINDS MSB AND MODIFIES AR5. 


Zz 
Oo 
aw 
= 
* 
| 


MC = MA * MB (NORMALIZED) 


n 
cs) 
Q 
a) 
= 
QD 


RET ; RETURN TO MAIN PROGRAM. 


Floating-point implementation programs often require denormalization as well 
as normalization to return results in a 16-bit format. Example 5-34 and Ex- 
ample 5-35 are tailored for denormalizing numbers that were normalized using 
the NORM instruction. This program assumes that the mantissa is in the ac- 
cumulator and the exponent is in an auxiliary register, which is the format of 
the NORM instruction after execution. 
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Example 5-34. Using LACT for Denormalization (TMS32020) 


* THIS ROUTINE DENORMALIZES NUMBERS NORMALIZED BY THE NORM 
* INSTRUCTION. THE DENORMALIZED NUMBER WILL BE IN THE 


* ACCUMULATOR. 
* 


DENORM LARP 1 ; USE AR1 TO POINT AT BLOCK BO. 
LRLK AR1,>200 
SAR ARO,*+ ; STORE EXPONENT AT >200. 
SACH *- ; STORE MANTISSA AT >201. 


* 


* SUBTRACT EXPONENT FROM 16 TO DETERMINE THE NUMBER OF SHIFTS 
* REQUIRED TO DENORMALIZE. 


LAC * ; LOAD ACCUMULATOR WITH EXPONENT. 
BZ OUT ; CHECK FOR ZERO EXPONENT. 
LACK >10 
SUB * 
SACL * 
LT e+ 
LACT * ; DENORMALIZE NUMBER. 
RET ; RETURN TO MAIN PROGRAM. 
OUT MAR ¥*+ ; POINT TO MANTISSA. 
ZALH * ; LOAD ACCUMULATOR WITH RESULT. 
RET ; RETURN TO MAIN PROGRAM. 


Example 5-35. Using LACT for Denormalization (TMS320C25) 


* THIS ROUTINE DENORMALIZES NUMBERS NORMALIZED BY THE NORM 
* INSTRUCTION (NORM *-), THE DENORMALIZED NUMBER WILL BE IN 
* THE ACCUMULATOR. 
* 
DENORM LARP 1 ; USE AR1 TO POINT AT BLOCK BO. 
LRLK AR1,>200 
SAR AR4,*+ ; STORE EXPONENT AT >200. 
SACH *- ; STORE MANTISSA AT >201. 
* 
LAC * ; LOAD ACCUMULATOR WITH EXPONENT. 
BZ OUT ; CHECK FOR ZERO EXPONENT. 
LT = -*+ 
LACT * ; DENORMALIZE NUMBER. 
RET ; RETURN TO MAIN PROGRAM. 
OUT MAR *+ ; POINT TO MANTISSA. 
ZALH * ; LOAD ACCUMULATOR WITH RESULT. 
RET ; RETURN TO MAIN PROGRAM. 
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5.6.7 Indexed Addressing 


The Auxiliary Register Arithmetic Unit (ARAU) allows the next indirect ad- 
dress to be calculated using increment/decrement calculations or indexed ad- 
dressing in parallel to the current arithmetic operation. For example, in the 
multiplication of two matrices, the operation requires addressing across the 
rows (incrementing the address by one) or down the columns (incrementing 
by n). Example 5-36 gives the code for multiplying a row times a column of 
two 10 x 10 matrices. The first matrix resides in data RAM block B1, and the 
second matrix resides in block BO. 


Example 5-36. Row Times Column 


LARK 0O,>A ; SET INDEX TO 10. 

LARP 1 ; USE AR1 FOR ADDRESSING THE COLUMN. 
LRLK 1,>300 ; POINT AR1 TO THE START OF BLOCK Bl. 
CNFP ; SET BO TO PROG ADDRESS FOR PIPELINE. 
ZAC - ; INITIALIZE THE ACCUMULATOR. 

MPYK 0O ; CLEAR THE PRODUCT REGISTER. 

RPTK 9 ; REPEAT 10 TIMES AS MATRIX DIMENSION. 
MAC >FFOO,*0+ ; MULTIPLY ROW TIMES COLUMN. 

APAC ; EXECUTE FINAL ACCUMULATION. 


ACCUMULATOR CONTAINS PRODUCT. 


The algorithm in Example 5-36 executes in 22 machine cycles. The key to this 
performance is the parallel addressing of both multiplicands simultaneously. 
The operation is made possible by the use of the data bus to fetch one multi- 
plicand and the program bus to fetch the other. The auxiliary register indexes 
down the column of one matrix while the PC generates incremental address- 
ing of each row of the other matrix. Each cycle of the repeat loop performs the 
following operations: 


1) |Accumulates the previous product, 

2)  Miultiplies the row element times the column element, 
3) Increments the row address, and 

4) Indexes the column address. 


5.6.8 Extended-Precision Arithmetic 


Numerical analysis, floating-point computations, or other operations may re- 
quire arithmetic to be executed with more than 32 bits of precision. Since the 
TMS320C2x are 16/32-bit fixed-point processors, software is required for the 
extended-precision of arithmetic operations. Subroutines that perform the 
extended-arithmetic functions for both the TMS32020 and TMS320C25 are 
provided in the examples of this section. The technique consists of performing 
the arithmetic by parts, similar to the way in which longhand arithmetic is 
done. 


The TMS320C25 has two features that help to make extended-precision cal- 
culations more efficient. One of the features is the carry status bit. This bit is 
affected by all arithmetic operations of the accumulator (ABS, ADD, ADDC, 
ADDH, ADDK, ADDS, ADDT, ADLK, APAC, LTA, LTD, LTS, MAC, MACD, 
MPYA, MPYS, NEG, SBLK, SPAC, SORA, SQRS, SUB, SUBB, SUBC, SUBH, 
SUBK, SUBS, and SUBT). The carry bit is also affected by the rotate and shift 
accumulator instructions (ROL, ROR, SFL, and SFR) or may be explicitly 
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modified by the load status register ST1 (LST1), reset carry (RC), and set carry 
(SC) instructions. For proper operation, the overflow mode bit should be reset 
(OVM = QO) so that the accumulator results will not be loaded with the satu- 
ration value. Note that this means that some additional code may be required 
if overflow of the most significant portion of the result is expected. 


The carry bit is set whenever the addition of a value from the input scaling 
shifter or the P register to the accumulator contents generates a carry out of 
bit 31. Otherwise, the carry bit is reset since the carry out of bit 31 is a zero. 
One exception to this case is the ADDH instruction which can only set the 
carry bit. This allows the accumulation to generate the proper single carry 
when either the addition to the lower or upper half of the accumulator actually 
causes the carry. The following examples help to demonstrate the significance 
of the carry bit on the TMS320C25 for additions: 


Cc MSB LSB Cc MSB LSB 
X FRFFF FFFF ACC X FRFFF FFFF ACC 
+ 1 +FREEF FEFEF 
1 0000 0000 1 FRFFF FFFE 
7F FF FFFEF Acc X 7FEF FFEF ACC 
+ 1 +P REF FEEE 
0 8000 0 1 7TFFF FFFE 
xX 8000 0000 acc Xx 8000 0000 acc 
+ 1 +F FEF FFEEF 
0 800 0 1 7FFPF FFFF 
1.0000 0 0 ACC 1 FRFFF FFFF ACC 
+ 0 (ADDC) + Q (ADDC) 
0 0000 0001 1 0000 0000 
1 8000 FFFF Acc 1 8000 FFFF ACC 
+0000 0000 (ADDH) +7F FEF 0000 (ADDH) 
1 8000 FFFF 1 FRFPFF FRFFF 


Example 5-37 shows an implementation of two 64-bit numbers added to each 
other to obtain a 64-bit result. This example, which adds 16-bit parts and 
generates a carry (C) bit in the accumulator, will run on the TMS32020. 


Software Applications - Advanced Arithmetic Operations 


Example 5-37. 64-Bit Addition (TMS32020) 


* TWO 64-BIT NUMBERS ARE ADDED TO EACH OTHER PRODUCING A 
* 64-BIT RESULT. THE NUMBERS X (X3,X2,X1,X0) AND ¥ 
* (Y3,¥2,Y1,YO) ARE ADDED RESULTING IN W (W3,W2,W1,WO). 


X3 X2 X1 xO 
* + ¥3-Y¥2 Y1 YO 
KR a ee ee ee ce ne me ee 
* W3 W2 Wl wo 
* 
ADD64 ZALS XO ; ACC = 00 xO 
ADDS YO ; ACC = 00 XO + 00 YO = Cc wo 
SACL WO 
SACH CARRY 
LAC. CARRY 7 ACC =-00 °C 
ADDS Xl ; ACC = 00 C + OO x1 
ADDS Y1 ; ACC = 00 C + OO X1 + 00 Y1 = C WIL 
SACL W1 
SACH CARRY 
LAC CARRY ; ACC = 00 C 
ADDS x2 ; ACC = 00 C + 00 x2 
ADDS Y2 ; ACC = 00 C + 00 ¥2 + 00 Y2 =C wW2 
SACH CARRY 
LAC CARRY ; ACC = 00 C 
ADDS x3 ; ACC = 00 C + 00 x3 
ADDS Y3 ; ACC = 00 C + 00 X3 + 00 Y3 = C W3 
SACL W3 : | 
RET 


Example 5-38 performs the same addition as Example 5-37 but is specific to 
the TMS320C25. This implementation makes use of the carry (C) status bit, 
adding 32-bit parts. 


Example 5-38. 64-Bit Addition (TMS320C25) 
* TWO 64-BIT NUMBERS ARE ADDED TO EACH OTHER PRODUCING A 


* 64-BIT RESULT. THE NUMBERS X (X3,X2,X1,X0) AND Y 
* (¥3,Y2,Y¥1,Y0) ARE ADDED RESULTING IN W (W3,W2,W1,WO). 
* 


* X3 X2 X1 xO 
* + ¥3 ¥2 ¥1 YO 
Kn oe ea ae ea cae ae ee aw ao 
* W3 W2 W1 wo 
* 
ADD64 ZALH X1 ; ACC = X1 00 
ADDS XO ; ACC = Xl xO 
ADDS YO ; ACC = X1 XO + 00 YO 
ADDH Yl ; ACC = X1 XO + ¥1 YO = W1 WO 
SACL wo 
SACH Wl 
ZALH X3 ; ACC = x3 00 
ADDC X2 ; ACC = X3 X2+C 
ADDS yY2 ; ACC = X3 X2 + 00 Y2+C 
ADDH Y3 ; ACC = X3 X2 + Y3 Y2 + C = W3 W2 
SACL w2 
SACH W3 
RET 
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In a similar way to addition, the carry bit on the TMS320C25 is reset whenever 
the input scaling shifter or the P-register value subtracted from the accu- 
mulator contents generates a borrow into bit 31. Otherwise, the carry bit is 
set since no borrow into bit 31 is required. One exception to this case is the 
SUBH instruction which can only reset the carry bit. This allows the gener- 
ation of the proper single carry when either the subtraction from the lower or 
upper half of the accumulator actually causes the borrow. The following ex- 
amples help to demonstrate the significance of the carry bit for subtractions: 


Cc MSB LSB Cc MSB LSB 
xX 0000 0000 acc Xx 0000 0000 acc 
- 1 -FFFF FFFF 
FFF FFFF 0 0000 0001 
X 7FFF FFFF acc X 7FFEF FFFEF ACC 
- 1 -FFFF FFFEF 
1 7FFF E 8000 0000 
Xx 8000 0000 acc X 8000 0000 acc 
- 1 -FFFF FFFE 
1 7FFF FFFF 0 86000 00017 
0 0000 0000 acc O FFFF FFFF ACC 
- 0 (SUBB) - Q (SUBB) 
o FFFF FFFFE 1 FRFFPF FFFE 
0 8000 FFFF ACC 0 8000 FFFF ACC 
-0001 0000 (SUBH) -F FFF 0000 (SUBH) 
O 7FFF FFFF 0 8007 FFFF 


Example 5-39 implements the subtraction of two 64-bit numbers on the 
TMS32020. A borrow (B) is generated within the accumulator for each of the 
16-bit parts of the subtraction operation. 
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Example 5-39. 64-Bit Subtraction (TMS32020) 


* TWO 64-BIT NUMBERS ARE SUBTRACTED, PRODUCING A 64-BIT 
* RESULT. THE NUMBER Y (Y¥3,¥2,¥1,Y0) IS SUBTRACTED FROM 
* X (X3,X2,X1,X0) RESULTING IN W (W3,W2,W1,WO). 


* 
* X3 X2 X1 xO 
* - Y¥3 ¥2 Y1 YO 
Ket or ee oa eee eee ee ee wae 
* W3 W2 W1 WO 
* 
SUB64 ZALS XO ; ACC = 00 xO 
SUBS YO ; ACC = 00 XO - 00 YO = B WO 
SACL WO 
SACH BORROW 
LAC BORROW ; ACC = B 
ADDS Xl ; ACC = B + 00 X1 
SUBS Y1 ; ACC = B + 00 X1 - 00 Y1 = B Wil 
SACL W1 


LAC BORROW ; ACC = B 

ADDS X2 ; ACC = B + OO X2 

SUBS Y2 ; ACC = B + 00 X2 - OO Y2 = B W2 
SACL W2 

SACH BORROW 

LAC BORROW ; ACC = B 

ADDS X3 ; ACC = B + 00 X3 

SUBS Y3 ; ACC = B + 00 X3 - OO Y¥3 = B W3 
SACL W3 


The advantage of using the carry (C) status bit on the TMS320C25 in imple- 
menting the same subtraction as Example 5-39 is shown in the coding of Ex- 
ample 5-40. 


Example 5-40. 64-Bit Subtraction (TMS320C25) 
* TWO 64-BIT NUMBERS ARE SUBTRACTED, PRODUCING A 64-BIT 


* RESULT. THE NUMBER Y (Y¥3,Y¥2,Y1,YO) IS SUBTRACTED FROM 
* X (X3,X2,X1,X0) RESULTING IN W (W3,W2,W1,WO). 


* 
* X32 XD XL. KO 
* - Y3 ¥2 Y1 YO 
ed 
* W3 W2 W1 wo 
* 
SUB64 ZALH X1 7. ACC = X1 00 
ADDS XO ; ACC-=.x1. XO 
SUBS YO ; ACC = X1 XO - 00 YO 
SUBH Y1 ; ACC = Xl XO - Y1 YO = W1 WO 
SACL wo 
SACH W1 
ZALS X2 ; ACC = 00 X2 
SUBB Y2 ; ACC = 00 X2 - 00 Y2 -C 
ADDH X3 : BCC X80 82] O00 Y2-— 2 
SUBH Y3 ; ACC = X3 X2 - Y3 Y2 - C = W3 W2 
SACL W2 
SACH W3 
RET 
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The second feature of the TMS320C25 aiding in extended-precision calcu- 
lations is the MPYU (unsigned multiply) instruction. The MPYU instruction 
allows two unsigned 16-bit numbers to be multiplied and the 32-bit result 
placed in the product register in a single cycle. Efficiency is gained by the 
ability to generate partial products from the 16-bit portions of a 32-bit or 
larger value instead of having to split the value into 15-bit or smaller parts. 


Example 5-41 and Example 5-42 show implementations of multiplying two 
32-bit numbers to obtain a 64-bit result. The coding of Example 5-41 will 
perform the 32-bit multiply on a TMS32020. The advantage in using the 
MPYU instruction can be observed in Example 5-42, which will execute on 
the TMS320C25. 
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Example 5-41. 32 x 32-Bit Multiplication (TMS32020) 


TWO 32-BIT NUMBERS ARE MULTIPLIED, PRODUCING A 64-BIT RESULT. 
THE NUMBERS X (X1,X0) AND Y (Y¥1,Y0O) ARE MULTIPLIED RESULTING 
IN W (W3,W2,W1,W0O). 


W3 W2 W1 WO 


* 

* 

* 

* 

* 

* 

* 

* 

k 

* 

* 

* 

* 

* 

* THE PROCEDURE FOR MULTIPLICATION IS TO SEPARATE THE 32-BIT 

* MAGNITUDE VALUES OF X AND Y INTO THREE PARTS OF 2, 15, AND 15 
* BITS EACH. THE MULTIPLICATION BY PARTS THEN PRODUCES A 5-PART 
* RESULT OF 3, 15, 15, 15, AND 15 BITS, WHICH ARE RECOMBINED 

* INTO FOUR DATA WORDS OF 16 BITS EACH. 

* 

* X2 X1 XO 

* x Y2 Y1 YO 

* 
* 
* 
* 
* 
* 
* 
* 
* 
* 
* 
* 
* 
* 
* 


W4 W3 W2 W1 WO 
DETERMINE THE SIGN OF THE PRODUCT. 


MPY32 ZALS Xl 7 ACCL SXXX XXXX XXXX XXXX 
XOR Y1 ; ACCL 
; 


SACH SIGN,1 SAVE THE PRODUCT SIGN O=+, 1=-. 


* 


* TAKE THE ABSOLUTE VALUE OF BOTH X AND Y AND REPARTITION. 
* 


ABSX  ZALH X1 ; ACC = X1 00 
ADDS XO sc ACCS" x1 XO 
ABS 
SACH X1,1 ; SAVE |X2xX1|. 
AND M7FFF 
SACL XO ; SAVE |XO]. 
ZALS Xl 
SACH X2,1 ; SAVE |X2|. 
AND M7FFF 
SACL Xl ; SAVE |X1]|. 
ABSY  ZALH Y1 ; ACC = Y1 00 
ADDS YO ; ACC = Y1 YO 
ABS 
SACH Y1,1 ; SAVE |Y2y¥1]. 
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AND M7FFF 


SACL YO ; SAVE |YO]. 
ZALS Y1 

SACH Y2,1 ; SAVE |Y¥2]|. 
AND M7FFF 

SACL Yl ; SAVE |Y1]|. 


* 
* MULTIPLY |X| AND |Y| TO PRODUCE |W]. 
* 


MULT LAC X2 ; 'AND' FUNCTION IS A 1-BIT BY 
AND Y2 ; 1-BIT MULTIPLICATION. 
SACL W4 ; SAVE PARTIAL |W4]|. 
LT XO ; T = XO 
MPY YO ; T = XO, P = X0*YO 
PAC wo ; T = XO, P = XO*YO, ACC = XO*YO 
SACH W1,1 ; SAVE PARTIAL |W1|. 
AND M7FFF 
SACL WO ; SAVE |WO]|. 
ZALS W1 ; T = XO, P = XO*YO 

* 3; ACC = XO*YO*2**-16 
MPY Y1 >; T = XO, P = XO*Y1 

* 3; ACC = XO*YO*2**-16 
LTA X1 ; T = Xl, P = xO*Y1 

* 3; ACC = XO*Y1 + XO*YO*2**-16 
MPY YO 7; T = X1, P = X1*YO 

* ; ACC = XO*¥Y1 + XO*YO*2**-16 
LTA xO ; T = XO, P = X1*Y0O 

* 3; ACC = X1*YO + XO*Y1 + XO*YO*2**-16 
SACH W2,1 ; SAVE PARTIAL |W2]. 
AND M7FFF 
SACL Wl ; SAVE |W1l]. 
ZALS W2 ; T = XO, P = X1*YO 

* 3 ACC = (X1*YO+XO*Y1) *2**-16 
MPY Y2 3; T = XO, P = XO*Y2 

* >; ACC = (X1*YO+X0O*Y1) *2**-16 
LTA Xl ; T = Xl, P = xO*Y2 

* 3; ACC = XO*Y2 + (X1L*YO+XO*Y1) *2**-16 
MPY Yl >; T = Xl, P = X1*Y1 

* >; ACC = XO*Y2 + (X1*YO+XO*Y1) *2**-16 
LTA X2 > T = X2, P = X1¥Y1 

* 3; ACC = X1*Y1 + XO*Y2 + (X1*YO+XO*Y1) *2**-16 
MPY YO 3; T = X2, P = X2*YO 

bs ; ACC = X1*Y1 + XO*Y2 + (X1*YO+XO*Y1L) *2**-16 
LTA X1 3; T = X1, P = X2*YO 

x 3; ACC = X2*YO + XL*Y1 +X0O*Y2 

* 3 + (X1*YO+X0O*Y1) *2**-16 
SACH W3,1 ; SAVE PARTIAL |W3{. 
AND M7FFF 
SACL W2 ; SAVE |W2]. 
ZALS W3 ; T = Xl, P = X2*YO 

* >; ACC = (X2*YO+X1*Y1+X0*Y2) *2**-16 
MPY Y2 >; T = Xl, P = X1*Y2 

* 3 ACC = (X2*YO+X1*Y1+XO0*Y2) *2**-16 
LTA X2 7 T = X2, P = X1*Y2 

= 3; ACC = X1*Y2 + (X2*YO+X1*Y1+XO*Y2) *2**-16 
MPY Yl > T = X2, P = X2*Y1 

* 7 ACC = X1*Y2 + (X2*YO+X1*Y1+X0*Y2) *2**-16 
APAC ; ACC = X2*Y1 + X1*yY2 

* ; + (X2*YO+tX1*Y14+X0* Y2) *2**-16 
ADD W4,15 7 ACC = X2*Y2*2**15 + X2*Y1 + X1*Y2 


CK 2RV OLR IY LE ROAY2 A 24816 
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SACH W4,1 ; SAVE |W4]. 
AND M7FFF 
SACL W3 ; SAVE |W3]. 


* RECOMBINE W AND GENERATE TWO'S-COMPLEMENT RESULT. 


* 


ZAC 
SUB SIGN 
SACL SIGN ; SIGN O=+, -l=-. 
* 
LAC wW1,15  ; ACC = jW1 OO] 
ADD wo ; ACC = |W1 WO] 
ADD SIGN 
XOR SIGN ; COMPLEMENT WO WHEN SIGN = -1. 
SACL wo ; SAVE WO. 
SACZ W1 ; SAVE PARTIAL |Wl]. 
LAC W2,14 ; ACC = jW2 00} 
ADD Wl ; ACC = |W2 Wl] 
XOR SIGN ; COMPLEMENT W1 WHEN SIGN = -1. 
SACL W1 ; SAVE Wl. 
SACH W2 ; SAVE PARTIAL |W2]. 
LAC W3,13  ; ACC = |W3 00] 
ADD wW2 ; ACC = |W3 W2] 
XOR SIGN ; COMPLEMENT W2 WHEN SIGN = -1. 
SACL W2 ; SAVE W2. 
SACH W3 ; SAVE PARTIAL |W3]. 
‘LAC W4,12  ; ACC = |W4 00] 
ADD W3 ; ACC = |W4 W3] 
XOR SIGN ; COMPLEMENT W3 WHEN SIGN = -1. 
SACL W3 ; SAVE W3. 
RET 
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Example 5-42. 32 x 32-Bit Multiplication (TMS320C25) 


* TWO 32-BIT NUMBERS ARE MULTIPLIED, PRODUCING A 64-BIT 
* RESULT. THE NUMBERS X (X1,X0) AND Y (Y1,Y0O) ARE 
* MULTIPLIED RESULTING IN W (W3,W2,W1,W0O). 
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* 


W3 W2 W1 WO 


DETERMINE THE SIGN OF THE PRODUCT. 


+ OF OO 
bas) 
aed 
* 
K 
ke 


MPY32 ZALS X1 
XOR Y1 
SACH SIGN,1 


* 


* TAKE THE ABSOLUTE 

* 

ABSX ZALH X1 
ADDS XO 
ABS 
SACH X1 
SACL XO 

ABSY ZALH Y1 
ADDS YO 
ABS 
SACH Y1 
SACL YO 


* 


* MULTIPLY |X| AND 

* 

MULT LT xO 
MPYU YO 
SPL Wl 
SPH wo 
MPYU Y1 
LTP X21 
MPYU YO 
ADDS Wl 


MPYA Y1 


SACL W1 
SACH W2 
ZALS W2 


BNC SUM 
ADDH ONE 

SUM APAC 
SACL Ww2 
SACH W3 


e 
’ 
. 
‘ 
. 
‘ 


ACCL = SXXX XXXX XXXX XXXX 
ACCL = S--- ---- ---- oe 
SAVE THE PRODUCT SIGN O=+, 1=-. 


VALUE OF BOTH X AND Y. 


we Ne 


we Ne Ne Ne 


we Ne 


ACC = X1 00 
AcC = X1 x0 
SAVE |X1|. 
SAVE |XO|. 
ACC = Y1 00 
ACC = Y1 x0 
SAVE |Y1|. 
SAVE |YO|. 


|¥| TO PRODUCE |W]. 


me Ne Ne Ne Ne Ne We Ne Ne Ne Ne Ne Ne Ne Ne Ne 


me Ne Ne 


T = XO 

T = XO, P = XO*YO 

SAVE |WO]|. 

SAVE PARTIAL |W1]|. 

T = XO, P = XO*Y1 

T = X1, P = XO*Y¥1, ACC = xO*Y1 
T = Xl, P = X1*YO, ACC = XO*Y1 
T = Xl, P = X1*YO, 


Acc = XO*Y1l + XO*YO*2**-16 

,T = X1, P = X1*Yi, : 

ACC = X1*YO + XO*Y1 + XO*YO*2**-16 
SAVE |W1]. 

SAVE PARTIAL |W2]. 

P = X1*Y1, 

ACC = (X1*YO + xXO*y1)*2**-16 

TEST FOR CARRY FROM W2. 


ACC = X1*Y1 + (X1*YO + XO*Y1)*2**-16 
SAVE |W2|. 
SAVE |W3]. 
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* TEST THE SIGN OF THE PRODUCT; NEGATE IF NEGATIVE. 
x 


DONE 


LAC 
BZ 


ZALH 
ADDS 
CMPL 
ADD 

SACL 
SACH 
ZALS 
ADDH 
CMPL 
ADDC 
SACL 
SACH 
RET 


SIGN 
DONE 


W1 
wo 


ONE 
wo 
W1 
W2 
W3 


ONE 
W2 
W3 


f 


er Oe ee TT 


we we Ne 


RETURN IF POSITIVE. 


ACC = |W1 00| 

ACC = |W1 WO] 

ACC = Wl WO AND CARRY GENERATION 
SAVE WO. 

SAVE Wl. 

ACC = |00 w2] 

ACC = |W3 W2| 

ACC = W3 W2 

SAVE W2. 

SAVE W3. 
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5.7 Application-Oriented Operations 


The TMS320C2x has been designed to provide efficient implementations of 
many common digital signal processing algorithms. The architecture sup- 
porting these design features was discussed in Section 3. In general, the fea- 
tures provide efficient solutions to numerically intensive problems usually 
characterized by multiply/accumulates. Some device-specific features that aid 
in the implementation of specific algorithms include companding, filtering, 
Fast Fourier Transforms (FFT), and PID contro!. These applications require |/O 
performed either in parallel or serial. Hardware requirements for 1/O are dis- 
cussed in Sections 3 and 6. 


5.7.1 Companding 
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In the area of telecommunications, one of the primary concerns is the I/O 
bandwidth in the communications channel. One way to minimize this band- 
width is by companding (COMpress/exPAND). Companding is defined by 
two. international standards, A-law and y-law, both based on the compression 
of the equivalent of 13 bits of dynamic range into an 8-bit code. The standard 
employed in the United States and Japan is p-law companding. The European 
standard is referred to as A-law companding. Detailed descriptions and code 
examples of p-law and A-law companding are presented in an application re- 
port on companding routines included in the book, Digita/ Signal Processing 
Applications with the TMS320 Family. 


The technique of companding allows the digital sample information corre- 
sponding to a 13-bit dynamic range to be transmitted as 8-bit data. For pro- 
cessing in the TMS320C2x, it is necessary to convert the 8-bit (logarithmic) 
sign-magnitude data to a 16-bit two’s-complement (linear) format. Prior to 
output, the linear result must be converted to the compressed or companded 
format. Table lookup or conversion subroutines may be used to implement 
these functions. 


Software routines for p-law and A-law companding, flowcharts, companding 
algorithms, and detailed descriptions are provided in the application report on 
companding routines mentioned above. The algorithm space and time re- 
quirements for y-law and A-law companding on the TMS32020/C25 are 
given in Table 5-1. 


Table 5-1. Program Space and Time Requirements for »p-/A-Law 
Companding 


FUNCTION | MEMORY WORDS | PROGRAM CYCLES 
Program Data |Initialization Loopt 

pi-Law: 

Compression 74 8 19 45 

Expansion 276 2 14 5 
A-Law: 

Compression 100 8 

Expansion 276 2 


tAssuming initialization 
Worst case 


TIME (ps) REQDT 
‘20 ’C25 


9 45 
1 0.5 
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In expanding from the 8-bit data to the 13-bit linear representation, table 
lookup is very effective since the table length is only 256 words. This is es- 
pecially true for a microcomputer design since the TMS320C25 has 4K words 
of mask-programmable ROM. The table lookup technique requires three in- 
structions (four words of program memory), one data memory location, 256 
words of table memory, and seven instruction cycles (program in on-chip 
ROM) to execute. 


LAC SAMPLE ; LOAD 8-BIT DATA. 
ADLK MUTABL ; ADD THE CONVERSION TABLE BASE ADDRESS. 
TBLR SAMPLE ; READ THE CORRESPONDING LINEAR VALUE. 


The above conversion could be programmed as a subroutine. This would 
eliminate the need for a table, but would increase execution time and require 
additional data memory locations. 


When the output data has been determined in a system transmitting com- 
panded data, a compression of the data must be performed. The compression 
reduces the data back to the 8-bit format. Unless memory for a table of length 
16384 is acceptable, the table lookup approach must be abandoned for con- 
version routines. Details of these implementations may be found in the appli- 
cation report on companding. 


Access to new companding code as it becomes available is provided via the 
TMS320 DSP Bulletin Board Service. The bulletin board contains TMS320 
source code from application reports included in Digital Signal Processing 
Applications with the TMS320 Family. See the TMS320 Family Development 
Support Reference Guide for information on how to access the bulletin board. 


5.7.2 FIR/IUR Filtering 


Digital filters are a common requirement for digital signal processing systems. 
The filters fall into two basic categories: Finite Impulse Response (FIR) and 
Infinite Impulse Response (IIR) filters. For either category of filter, the coeffi- 
cients of the filter (weighting factors) may be fixed or adapted during the 
course of the signal processing. The theory and implementation of digital fil- 
ters has been presented and discussed in an application report (see the book, 
Digital Signal Processing Applications with the TMS320 Family). The 
TMS320C25 reduces the execution time of all filters by virtue of its 100-ns 
instruction cycle time. 


IIR filters benefit from the 100-ns instruction cycle time of the TMS320C25. 
IIR filters typically require fewer multiply/accumulates. Correspondingly, the 
amount of data memory for samples and coefficients is not usually the limiting 
factor. Because of sensitivity to quantization of the coefficients themselves, 
IIR filters are usually implemented in cascaded second-order sections. This 
translates to instruction code consisting of LTD-MPY instruction pairs rather 
than MACDs. Example 5-43 provides an implementation of a second-order 
IIR filter. 
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Example 5-43. Implementing an IIR Filter 
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* 

* THE FOLLOWING EQUATIONS ARE USED TO IMPLEMENT AN IIR FILTER: 
* 

* d(n) = x(n) + d(n-1)al + d(n-2)a2 

* y(n) = d(n)bO + d(n-1)b1l + d(n-2)b2 

* 


START IN XN,PAO ; INPUT NEW VALUE XN 
LAC XN,15 ; LOAD ACCUMULATOR WITH XN 


* 


LT DNM1 
MPY Al 

* 
LTD DNM2 
MPY A2 

* 
APAC 
SACH DN,1l >; d(n) = x(n) + d(n-1)al + d(n-2)a2 
ZAC 
MPY B2 

* ‘ 
LTD DNM1 
MPY Bl 

* 
LTD DN 
MPY BO 

* 
APAC 


SACH YN,1 ; y(n) = d(n)bO + d(n-1)bl + d(n-2)b2 
OUT YN,PA1L ; YN IS THE OUTPUT OF THE FILTER 


FIR filters also benefit from the faster instruction cycle time. In addition, an 
FIR filter requires many more multiply/accumulates than does the IIR filter 
with equivalent sharpness at the cutoff frequencies and distortion and atten- 
uation in the passbands and stopbands. The TMS320C2x can help solve this 
problem by making longer filters feasible to implement. This is accomplished 
by allowing the coefficients to be fetched from program memory at the same 
time as a sample is being fetched from data memory. The simple implementa- 
tion of this process uses the MACD instruction with the RPT/RPTK instruc- 
tion. 


RPTK 255 
MACD COEFFP, *- 


The coefficients on the TMS32020 may be stored anywhere in on-chip RAM. 
Filters of up to 256 taps can be implemented at an execution speed of 200 
ns per tap. 


The coefficients on the TMS320C25 may be stored anywhere in program 


memory (reconfigurable on-chip RAM, on-chip ROM, or external memories). 
When the coefficients are stored in on-chip ROM or externally, the entire on- 
chip data RAM may be used to store the sample sequence. Ultimately, this 
allows filters of up to 512 taps to be implemented on the TMS320C25. The 
filter executes at full speed or 100 ns per tap as long as the memory supports 
full-speed execution. 
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5.7.3 Adaptive Filtering 


With FIR/IIR filtering, the filter coefficients may be fixed or adapted. If the 
coefficients are adapted or updated with time, then another factor impacts the 
computational capacity. This factor is the requirement to adapt each of the 
coefficients, usually with each sample. The MPYA or MPYS and ZALR in- 
structions on the TMS320C25 aid with this adaptation to reduce the exe- 
cution time. 


A means of adapting the coefficients on the TMS320C2x is the Least- Mean- 
Square (LMS) algorithm given by the following equation: 


byll+t) = by () + 2B eli) x(i-k) 
where afi) = x(i) - y(i) 
N-1 
and yli) =)" by x(i-K) 
k=0 


Quantization errors in the updated coefficients can be minimized if the result 
is obtained by rounding rather than truncating. For each coefficient in the filter 
at a.given point in time, the factor 2*B*e(i) is a constant. This factor can then 
be computed once and stored in the T register for each of the updates. Thus, 
the computational requirement has become one multiply/accumulate plus 
rounding. Without the new instructions, the adaptation of each coefficient is 
five instructions corresponding to five clock cycles. This is shown in the fol- 
lowing instruction sequence: 


LRLK AR2,COEFFD ; LOAD ADDRESS OF COEFFICIENTS. 
LRLK AR3,LASTAP ; LOAD ADDRESS OF DATA SAMPLES. 


LARP AR2 ; 

LT ERRF 3; errf = 2*B*e(i) 

ZALH *,AR3 ; ACC = bk(i)*2**16 

ADD ONE,15 ; ACC = bk(i)*2**16 + 2**15 

MPY *-,AR2 

APAC >; ACC = bk(i)*2**16 + errf*x(i-k) + 2**15 


SACH *+ ; SAVE bk(itl). 


When the MPYA and ZALR instructions on the TMS320C25 are used, the 
adaptation reduces to three instructions corresponding to three clock cycles, 
as shown in the following instruction sequence. Note that the processing or- 
der has been slightly changed to incorporate the use of the MPYA instruction. 
This is due to the fact that the accumulation performed by the MPYA is the 
accumulation of the previous product. . 
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LRLK 
LRLK 
LARP 
LT 


ZALR 
MPYA 


SACH 


AR2 ,COEFFD 
AR3,LASTAP 
AR2 

ERRF 


* ,AR3 
*=—,AR2 


* + 


. 
? 


. 
, 


et 


LOAD ADDRESS OF COEFFICIENTS. 
LOAD ADDRESS OF DATA SAMPLES. 


errf = 2*B*e(i) 


ACC = bk(i)*2**16 + 2**15 

ACC = bk(i)*2**16 + errf*x(i-k) + 2**15 
PREG = errf*x(i-k+1) 

SAVE bk(it+l). 


Example 5-44 shows a routine to filter a signal and update the coefficients. 
Example 5-45 and Example 5-46 provide the conclusion to the adaptive FIR 
filter routine for the TMS32020 and TMS320C25, respectively. 


Adaptive filter length is restricted both by execution time and memory. There 
is obviously more processing to be completed per sample due to the adapta- 
tion, and the adaptation itself dictates that the coefficients be stored in the 
reconfigurable block of on-chip RAM. Thus, the practical limit of an adaptive 
filter with no external data memory is 256 taps. 
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Example 5-44. 256-Tap Adaptive FIR Filter 


TITL 'ADAPTIVE FILTER! 
DEF ADPFIR 
DEF X,Y 


THIS 256-TAP ADAPTIVE FIR FILTER USES ON-CHIP MEMORY BLOCK 
BO FOR COEFFICIENTS AND BLOCK Bl FOR DATA SAMPLES. THE 
NEWEST INPUT SHOULD BE IN MEMORY LOCATION X WHEN CALLED. 
THE OUTPUT WILL BE IN MEMORY LOCATION Y WHEN RETURNED. 
ASSUME THAT THE DATA PAGE IS O WHEN THE ROUTINE IS CALLED. 


e+ % FH OH 


* 


COEFFP EQU »>FFOO 
COEFFD EQU »>0200 
* 


BO PROGRAM MEMORY ADDRESS 
BO DATA MEMORY ADDRESS 


=e oN 


NEXT NEWEST DATA SAMPLE 
OLDEST DATA SAMPLE 


FRSTAP EQU >0300 
LASTAP EQU >O3FF 
* 


* FINITE IMPULSE RESPONSE (FIR) FILTER. 
* 


ADPFIR CNFP 


ONE EQU >7A ; CONSTANT ONE (DP=0) 

BETA EQU >7B ; ADAPTATION CONSTANT (DP=0) 

ERR EQU >7C ; SIGNAL ERROR (DP=0) 

ERRF EQU >7D ; ERROR FUNCTION (DP=0) 

Y EQU >7E ; FILTER OUTPUT (DP=0) 

X EQU >7F ; NEWEST DATA SAMPLE (DP=0) 
, 


CONFIGURE BO AS PROGRAM: 


MPYK 0O ; Clear the P register. 

LAC ONE,14 ; Load output rounding bit. 

LARP AR3 . 

LRLK: AR3,LASTAP ; Point to the oldest sample. 
FIR RPTK 255 

MACD COEFFP, *- ; 256-tap FIR filter. 

CNFD ; CONFIGURE BO AS DATA: 

APAC 

SACH Y,1 ; Store the filter output. 

NEG 

ADD X,15 ; Add the newest input. 

SACH ERR,1 7; err(i) = x(i) - y(i) 


* 


LMS ADAPTATION OF FILTER COEFFICIENTS. 


LT ERR 
MPY BETA 
PAC errf(i) = beta * err(i) 


wee 


ADD ONE,14 ROUND THE RESULT. 


SACH ERRF,1 


MAR ¥+ 
LAC X ; INCLUDE NEWEST SAMPLE. 
SACL * 


LRLK AR2,COEFFD 
LRLK AR3,LASTAP 
LT ERRF 

MPY *-,AR2 


POINT TO THE COEFFICIENTS. 
POINT TO THE DATA SAMPLES, 


ue Ne 


P = 2*beta*err(i)*x(i-255) 


“eo 
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Example 5-45. Adaptive Filter Routine Concluded (TMS32020) 
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* 


ADAPT 


ZALH 
ADD 
APAC 
MPY 
SACH 


ZALH 
ADD 
APAC 
MPY 
SACH 


ZALH 
ADD 
APAC 
MPY 
SACH 


ZALH 
ADD 
APAC 
MPY 
SACH 


ZALH 
ADD 

APAC 
SACH 


RET 


* ,AR3 
ONE, 15 


*-,AR2 
+ 


* ,AR3 
ONE, 15 


*-,AR2 
*+ 


* ,AR3 
ONE, 15 


*-,AR2 
e+ 


* ,AR3 
ONE,15 


*-,AR2 
*+ 


* 
ONE,15 


* + 


ee 


we me Me Ne Ne 


se Ne Ne Ne 


we 


LOAD ACCH WITH b255(i). 
ADD ROUNDING BIT. 
b255(it1) = b255(i) + P 

P = 2*beta*err(i)*x(i-254) 
STORE b255(it+1). 


LOAD ACCH WITH b254(i). 
ADD ROUNDING BIT. 
b254(it1) = b254(i) + PB 

P = 2*beta*err(i)*x(i-253) 
STORE b254(it1). 


LOAD ACCH WITH b253(i). 
ADD ROUNDING BIT. 
b253(it1) = b253(i) + P 

P = 2*beta*err(i)*x(i-252) 
STORE b253(i+1). 


LOAD ACCH WITH bl(i). 
ADD ROUNDING BIT. 
b1(it1l) = bl(i) + B 

P = 2*beta*err(i)*x(i-0) 
STORE bl(itl). 


LOAD ACCH WITH bO(i). 
ADD ROUNDING BIT. 
bO(it1) = bO(i) + P 
STORE bO(it+1). 


RETURN TO CALLING ROUTINE. 
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Example 5-46. Adaptive Filter Routine Concluded (TMS320C25) 


* 


ADAPT  ZALR *,AR3 
MPYA *-,AR2 


LOAD ACCH WITH b255(i) & ROUND. 
b255(it1) = b255(i) + P 

P = 2*beta*err(i)*x(i-254) 
STORE b255(i+1). 


* 


me te Ne Ne 


SACH *+ 
* 
LOAD ACCH WITH b254(i) & ROUND. 
b254(it1l) = b254(i) + P 
P = 2*beta*err(i)*x(i-253) 
STORE b254(it1). 


ZALR *,AR3 
MPYA *-,AR2 


se Me Me Ne 


SACH *+ 


LOAD ACCH WITH b253(i) & ROUND. 
b253(it+t1l) = b253(i) + P 

P = 2*beta*err(i)*x(i-252) 
STORE b253(i+1). 


ZALR *,AR3 
MPYA *-,AR2 


SACH *+ 


ee TT) 


ZALR *,AR3 ; LOAD ACCH WITH b1(i) & ROUND. 
MPYA *-,AR2 b1(it+t1) = bl(i) + P 

P = 2*beta*err(i)*x(i-O) 
STORE bl(i+l). 


+ 
we Ne Ne SN 


SACH *+ 
ZALR * ; LOAD ACCH WITH bO(i) & ROUND. 
APAC ; bO(it1) = bO(i) + BP 

SACH *+ ; STORE bO(it+1). 


RET ; RETURN TO CALLING ROUTINE. 


Table 5-2 provides a comparison of data memory, program memory, and CPU 
cycles for a 256-tap adaptive FIR filter implementation using the TMS32020 
and TMS320C25. Note that n = 256 in the table. 


Table 5-2. 256-Tap Adaptive Filtering Memory Space and Time 
Requirements 


DEVICE WORDS IN MEMORY CPU CYCLES 
Data Program 
TMS32020 5 + 2n 29 + 5n 30 + 6n 
TMS320C25 5 + 2n 30 + 3n 33 + 4n 
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5.7.4 Fast Fourier Transforms (FFT) 
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Fourier transforms are an important tool often used in digital signal processing 
systems. The purpose of the transform is to convert information from the time 
domain to the frequency domain. The inverse Fourier transform converts in- 
formation back to the time domain from the frequency domain. Implementa- 
tions of Fourier transforms that are computationally efficient are known as Fast 
Fourier Transforms (FFTs). The theory and implementation of FFTs on the 
TMS32020 has been discussed in an application report in the book, Digital 
Signal Processing Applications with the TMS320 Family. The TMS320C25 
reduces the execution time of all FFTs by virtue of its 100-ns instruction cycle 
time. 


’ In addition to the shorter cycle time, an addressing feature has been added to 


the TMS320C25 which provides execution speed and program memory en- 
hancements for radix-2 FFTs. As demonstrated in Figure 5-6 and Figure 5-7 
the inputs or outputs of an FFT are not in sequential order, i.e., they are 
scrambled. The scrambling of the data addressing is a direct result of the ra- 
dix-2 FFT derivation. Observation of the figures and the relationship of the 
input and output addressing in each case reveal that the address indexing is 
a bit-reversed order, as shown in Table 5-3. As a result, either the data input 
sequence or the data output sequence must be scrambled in association with 
the execution of the FFT. 


STAGE 1 STAGE 2 STAGE 3 


x(0) 
x(4) 


X(0) 
X(1) 


x(2) 
x(6) 


X(2) 
X(3) 


X(4) 
x(5) 


x(1) 
x(5) 


x(3) 
x(7) 


X(6) 
X(7) 


Ae ge ap) a) waned 
LEGEND FOR TWIDDLE FACTOR: Wy= We W,=Wa Wo=Ws Wa= We 


Figure 5-6. An In-Place DIT FFT with in-Order Outputs and 
Bit-Reversed Inputs 
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STAGE 1 STAGE 2 STAGE 3 
x(0) X(0) 
x(1) X(4) 
x(2) X(2) 
x(3) X(6) 
x(4) X(1) 
x(5) X(5) 
x(6) X(3) 
x(7) X(7) 


. = yO -w! =w2 -w3a 
LEGEND FOR TWIDDLE FACTOR: Wo = Wag W, =We Wo = W5 W3* Ww 8 


Figure 5-7. An In-Place DIT FFT with In-Order Inputs but 
Bit-Reversed Outputs 


Table 5-3. Bit-Reversal Algorithm for an 8-Point Radix-2 DIT FFT 


| INDEX | BIT PATTERN | BIT-REVERSED PATTERN |BIT-REVERSED INDEX 
0 


MOOR WN © 


On the TMS32020, the bit reversal is handled by loading the accumulator with 
pairs of points that needed to be swapped and then storing them back in the 
swapped locations. An addressing feature that uses reverse carry-bit propa- 
gation allows the TMS320C25 to scramble the inputs or outputs while it is 
performing the I/O. The addressing mode is part of the indirect addressing 
implemented with the auxiliary registers and the associated arithmetic unit. In 
this mode (a derivative of indexed addressing), a value (index) contained in 
ARO is either added or subtracted from the auxiliary register being pointed to 
by the ARP. However, instead of propagating the carry bit in the forward di- 
rection, it is propagated in the reverse direction. The result is a scrambling in 
the address access. 


The procedure for generating the bit-reversal address sequence Is to load ARO 
with a value corresponding to one-half the length of the FFT and to load an- 
other auxiliary register, e.g., AR1, with the base address of the data array. Im- 
plementations of FFTs involve complex arithmetic; as a result, there are two 
data memory locations (one real and one imaginary) associated with every 
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data sample. Generally, the samples are stored in memory in pairs with the real 
part in the even address locations and the imaginary part in the odd address 
location. This means that the offset from the base address for any given sample 
is twice the sample index. Real input data is easily transferred into the data 
memory and stored in the scrambled order, with every other location in the 
data memory representing the imaginary part of the data. 


The following list shows the contents of auxiliary register AR1 when ARO is 
initialized with a value of 8 (8-point FFT) and when data is being transferred 
by the code that follows. 


MSB LSB 
ARO: 0000 0000 0000 100 0 8-Point FFT 
AR1: 0000 0010 0000 0 000 _ Base Address 


RPTK 7 
IN *BRO+,PAO 


AR1: 0000 0010 0000 0000 XR(0) 
AR1: 0000 0010 0000 1000 XR(4) 
AR1: 0000 0010 0000 0100 XR(2) 
AR1: 0000 OOO .O-0 0:0 1100 XR(6) 
AR1: 0000 0010 0000 0010 XR(1) 
AR1: 0000 0010 0000 1010 XR(5) 
AR1: 0000 0010 0000 0110 XR(3) 


AR1: 0000 0010 0000 1110 = XR(7) 


Example 5-47 consists of lists of macros used in the implementation of FFTs. 
The first macro implements the bit reversal in the way necessary for the 
TMS32020 and is not necessary for implementations on the TMS320C25. 
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Example 5-47. FFT Macros 


BITREV SMACRO PR,PI,QOR,QI 
* 


* BIT REVERSAL CODE - SWAP PR AND OR, SWAP PI AND QI. 


* 


ZALH 
ADDS 
SACL 
SACH 
ZALH 
ADDS 
SACL 
SACH 
SEND 


x 


COMBO SMACRO R1,11,R2,12,R3,13,R4,14 


* 


* CALCULATE PARTIAL TERMS FOR R3, R4, 13, AND I4. 


LAC :R3:,14 ACC 
ADD :R4:,14 ACC 
SACH :R3:,1 R3 
SUB :R4:,15 ACC 
SACH :R4:,1 R4 
LAC :13:,14 ACC 
ADD :14:,14 ACC 
SACH :1I3:,1 I3 
SUB :I14:,15 ACC 
SACH :14:,1 r4 

* 

* CALCULATE PARTIAL TERMS FOR 
LAC :R1:,14 ACC 
ADD :R2:,14 ACC 
SACH :R1:,1 R1 
SUB :R2:,15 ACC 
ADD :14:,15 ACC 
SACH :R2: R2 
SUBH :I14: ACC 
DMOV :R4: 14 
SACH :R4: R4 
LAC :11:,14 ACC 
ADD :12:,14 ACC 
SACH :I1:,1 Il 
SUB I2:,15 ACC 
SUB :14:,15 ACC 
SACH :12: I2 
ADDH :14: ACC 
SACH :I4: I4 

* 

* CALCULATE PARTIAL TERMS EOe 
LAC :R1:,15 ACC 
ADD :R3:,15 ACC 
SACH :R1: R1 
SUBH :R3: ACC 
SACH :R3: R3 
LAC :I1:,15 ACC 
ADD :13:,15 Acc 
SACH :I1: Il 
SUBH :1I3: ACC 
SACH :I3: I3 
SEND 


: 


(1/4) (R3) 

(1/4) (R3+R4) 

(1/2) (R3+R4) 

(1/4) (R3+R4)-(1/2) (R4) 
(1/2) (R3-R4) 

(1/4) (I3) 

(1/4) (I13+1I4) 

(1/2) (I13+14) 

(1/4) (13+14)-(1/2) (14) 
(1/2) (13-14) 


Hb ud db tt ud te hk on 


2, R44; 12, “AND “24. 
(1/4) (R1) 
(1/4) (R1+R2) 
eee 
(1/4) (R1+R2)-(1/2) (R2) 
(1/4) [(R1-R2)+(13- tay 
(1/4) [(R1-R2)+(13-14) ] 
(1/4) [(R1-R2)-(13-14) } 
R4 = (1/2) (R3-R4) 
(1/4) [(R1-R2)-(13-14) ] 
(1/4) (I1) 
(1/4) (I1+12) 
(1/2) (I1+12) 
(1/4) (11+I2)-(1/2) (12) 
(1/4) [(I1-12)-(13-14) ] 
(1/4) [(11-12)-(13-14) ] 
(1/4) [(11-12)+(13-14) ] 
(1/4) [(I1-12)+(1I3-14) ] 


Hud udu db bb tt bd bode i ys) 


1, R3, I1, AND 13. 

(1/4) (R1+R2) 

(1/4) [(R1+R2)+(R3+R4) 
(1/4) [(R1+R2)+(R3+R4) 
(1/4) [(R1+R2)-(R3+R4) 
(1/4) [(R1+R2) -(R3+R4) 
(1/4) (I1+12) 

(1/4) [(114+12)+(13+1I4) J 
(1/4) [(11+1I2)+(13+14) } 
(1/4) [(11+12)-(13+14) ] 
(1/4) [(I1+12)-(13+14) ] 


os ee 


nu dt tow nh hm men i ve) 
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* 


ZERO SMACRO PR,PI,QR,QI 

* 

* CALCULATE Re[P+Q] AND Re[P-Q] 
LAC :PR:,15 ACC = (1/2) (PR) 
ADD :QR:,15 Acc = (1/2) (PR+OR) 
SACH :PR: PR = (1/2) (PR+QR) 
SUBH :QR: ACC = (1/2) (PR+QR)-(QR) 
SACH :QR: QR = (1/2) (PR-QR) 

* 

* CALCULATE Im[P+Q] AND Im[P-Q] 
LAC :PI:,15 ACC = (1/2) (PI) 
ADD. #QO1s,;15 ACC = (1/2) (PI+QI) 
SACH :PI: PI = (1/2) (PI+QT) 
SUBH :QI: ACC = (1/2) (PI+QI)-(QT) 
SACH :QI: QI = (1/2) (PI-QI) 
SEND 

* 

PIBY4  S$MACRO PR,PI,QR,QI,W 

* 
LT :W: T REGISTER := W=COS(PI/4)=SIN(PI/4) 
LAC :QI:,14 ACC = (1/4) (QT) 
SUB :QR:,14 Acc = (1/4) (QI-QR) 
SACH :QI:,1 QI = (1/2) (QI-QR) 
ADD :QR:,15 ACC = (1/4) (QI+QR) 
SACH :QR:,1 OR = (1/2) (QI+QR) 
LAC :PR:,14 ACC = (1/4) (PR) 
MPY :QR: P REGISTER := (1/4) (QI+QR) *W- 
APAC ACC = (1/4) [PR+(QI+OR) *W] 
SACH :PR:,1 PR = (1/2) [PR+(QI+OR) *W] 
SPAC ACC = (1/4) (PR) 
SPAC Acc = (1/4) [PR-(QI+QOR) *W] 
SACH :QR:,1 OR = (1/2) [PR-(QI+QR) *W] 
LAC :PI:,14 ACC = (1/4) (PI) 
MPY :QI: P REGISTER := (1/4) (QI-QR) *W 
APAC ACC = (1/4) [PI+(QI-OR) *W] 
SACH :PI:,1 PI := (1/2) [PI+(QI-QOR) *W] 
SPAC ACC := (1/4) (PI) 
SPAC ACC = (1/4) [PI-(QI-QR) *wW] 
SACH :QI:,1 QI = (1/2) [PI-(QI-QR) *W] 
SEND 

* 

PIBY2 $MACRO PR,PI,QR,QI 

* 

* CALCULATE Re[P+jQ] AND Re[P-jQ] 
LAC :PI:,15 ACC = (1/2) (PI) 
SUB :QR:,15 ACC = (1/2) (PI-QR) 
SACH :PI: PI = (1/2) (PI-QR) 
ADDH :QR: ACC = (1/2) (PI-QR)+(QR) 
SACH :QR: QR = (1/2) (PI+QR) 

* 

* CALCULATE Im{P+jQ] AND Im[P-jQ] 
LAC :PR:,15 AcC := (1/2) (PR) 
ADD :QI:,15 ACC = (1/2) (PR+QI) 
SACH :PR: PR = (1/2) (PR+QI) 
SUBH :QI: ACC = (1/2) (PR+QI)-(QI) 
DMOV :QR: OR -> QI 
SACH :QR: QR = (1/2) (PR-QI) 
SEND 
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* 


PI3BY4 SMACRO 

* 
LT 
LAC 
SUB 
SACH 
ADD 
SACH 
LAC 
MPY 
APAC 
SACH 
SPAC 
SPAC 
MPY 
SACH 
LAC 
SPAC 
SACH 
APAC 
APAC 
SACH 
SEND 


PR,PI,OR,QI,W 


:W: T REGISTER := W=COS(PI/4)=SIN(PI/4) 
:Q1:,14 ACC = (1/4) (or) 
:QOR:,14 ACC = (1/4) (QI-QOR) 
201s; 1 QI = ae cn QR) 
:OR:,15 Acc = (1/4) (QI+QR) 
:OR:,1 OR = en (QI+QR) 
:PR:,14 ACC = (1/4) (PR) 
:Q1: P REGISTER := (1/4) (QI-QR) *W 
AGC = (1/4) [PR+(QI-QR) *W] 
:PR:,1 PR = (1/2) [PR+(QI-OR) *W] 
ACG = (1/4) (PR) 
ACC = (1/4) [PR-(QI-QR) *W] 
:QR: P REGISTER := (1/4) (QI+QR) *W 
:OR:,1 OR = (1/2) [PR-(QI-QR) *W] 
:PI:,14 ACC = (1/4) (PI) 
ACC = (1/4) [PI-(QI+QR) *W] 
:PI:,1 PI = (1/2) [PI-(QI+QR) *W] 
ACC = (1/4) (PI) 
ACC := (1/4) [PI+(QI+QR) *W] 
:QI:,1 QI = (1/2) [PI+(QI+OR) *W] 


Example 5-48 shows the bit-reversal addressing capability of the TMS320C25 
for implementing an 8-point DIT FFT. On the TMS320C25 the following in- 
structions input the data and store it in memory in the bit-reversed sequence: 


RPTK 7 


IN *BRO+, PAO 


This code combines the functions of input and bit-reversal addressing which 
were previously implemented separately on the TMS32020. The following 
implementation uses a separate bit-reverse macro: 


RPTK 7 

IN *0+,PA0 

BITREV X1R,X11,X4R,X41 
BITREV X3R,X31I,X6R,X61 
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Example 5-48. An 8-Point DIT FFT 


5-74 


XOR EQU 00 

XOI EQU O01 

X1R EQU 02 

X1I EQU 03 

X2R EQU 04 

X21 EQU 05 

X3R EQU 06 

X31 EQU 07 

X4R EQU 08 

X41 EQU 09 

X5R EQU 10 

X51 EQU 11 

X6R EQU 12 

X61 EQU 13 

X7R EQU 14 

X71 EQU 15 

W EQU 16 

WVALUE EQU >5A82 ; VALUE FOR SIN(45) OR COS(45) 
: 


* INITIALIZE FFT PROCESSING. 
* 


FFT SPM 0 > NO SHIFT OF PR OUTPUT 
SSXM ; SET SIGN-EXTENSION MODE. 
ROVM > RESET OVERFLOW MODE. 
LDPK 4 ; SET DATA PAGE POINTER TO 4. 
LALK WVALUE ; GET TWIDDLE FACTOR VALUE. 
SACL W ; STORE SIN(45) OR COS(45). 


* 


INPUT SAMPLES, STORING IN BIT-REVERSED ORDER. 


LARK ARO,8 ; LOAD LENGTH OF FFT IN ARO. 

LRLK AR1,>200 ; LOAD AR1 WITH DATA PAGE 4 ADDRESS. 
LARP ARI 

RPTK 7 


IN *BRO+,PAO ; ONLY REAL-VALUED INPUT 


1ST & 2ND STAGES COMBINED WITH DIVIDE-BY-4 INTERSTAGE 
SCALING. 


+ ee 


COMBO XOR,X0I,X1R,X11,X2R,X21I,X3R,X3I1, 
COMBO X4R,X41,X5R,X51,X6R,X61,X7R,X71. 


* 3RD STAGE WITH DIVIDE-BY-2 INTERSTAGE SCALING. 


ZERO XOR,XOI,X4R,X41 
PIBY4 xX1R,X11I,X5R,X51,W 
PIBY2 X2R,X21,X6R,X6I 
PI3BY4 X3R,X31,X7R,X71,W 


* 


* OUTPUT SAMPLES, SUPPLYING IN SEQUENTIAL ORDER. 
LRLK AR1,>200 ; LOAD AR1 WITH DATA PAGE 4 ADDRESS. 


RPTK 15 
OUT *+,PAO0 + COMPLEX-VALUED OUTPUT 
RET 
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Table 5-4 provides a comparison of execution speed, program memory, and 
data memory for an 8-point DIT FFT implementation using the TMS32020 
and TMS320C25. 


Table 5-4. FFT Memory Space and Time Requirements 


DEVICE WORDS IN MEMORY CPU CYCLES TIME 
Data Program (us) 

TMS32020 17 169 216 43.2 
TMS320C25 17 153 178 17.8 


5.7.5 PID Control 


Control systems are concerned with regulating a process and achieving a de- 
sired behaviour or output from the process. A control system consists of three 
main Components: sensors, actuators, and a controller. Sensors measure the 
behavior of the system. Actuators supply the driving force to ensure the de- 
sired behaviour. The controller generates actuator commands corresponding 
to the error conditions observed by the sensors and the control algorithms 
programmed in the controller. The controller typically consists of an analog 
or digital processor. 


Analog control systems are usually based on fixed components and are not 
programmable. They are also limited to using single-purpose characteristics 
of the error signal, such as P (proportional), | (integral), and D (derivative), 
or their combination. These limitations, along with other disadvantages of 
analog systems such as component aging and temperature drift, are causing 
digital control systems to increasingly replace analog systems in most control 
applications. 


Digital control systems that use a microprocessor/microcontroller are able to 
implement more sophisticated algorithms of modern control theory, such as 
state models, deadbeat control, state estimation, optimal control, and adaptive 
contro}. Digital control algorithms deal with the processing of digital signals 
and are similar to DSP algorithms. The TMS320C2x instruction set can 
therefore be used very effectively in digital control systems. 


The most commonly used algorithm in both analog and digital control systems 
is the PID (Proportional, Integral, and Derivative) algorithm. The classical PID 
algorithm is given by 


u(t) = Kp e(t).+ Kj J edt + Kg de/dt 
The PID algorithm must be converted into a digital form for implementation 
on a microprocessor. Using a rectangular approximation for the integral, the 
PiD algorithm can be approximated as 

u(n) = u(n-1) + Ky e(n) + Ko e(n-1) + K 3 e(n-2) 


This algorithm is implemented in Example 5-49. 
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Example 5-49. PID Control 
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TITL ‘PID CONTROL’ 
DEF PID 
* 


* THIS ROUTINE IMPLEMENTS A PID ALGORITHM. 
* 


UN EQU O ; OUTPUT OF CONTROLLER 
EO EQU 1 ; LATEST ERROR SAMPLE 
El EQU 2 7 PREVIOUS ERROR SAMPLE 
E2 EQU 3 ; OLDEST ERROR SAMPLE 
Kl EQU 4 ; GAIN CONSTANT 

K2 EQU 5 ; GAIN CONSTANT 

K3 EQU 6 ; GAIN CONSTANT 


* 


* ASSUME DATA PAGE O IS SELECTED. 


* 


PID IN EQ,PAO ; READ NEW ERROR SAMPLE 

LAC UN 3; ACC = u(n-1) 

LT E2 ; LOAD T REG WITH OLDEST SAMPLE 

MPY K2 >; P = K2*e(n-2) 

LTD El 3; ACC = u(n-1)+K2*e(n-2) 

MPY Kl ; P = Kl*e(n-1) 

LTD EO ; ACC = u(n-1)+Ki*e(n-1)+K2*e(n-2) 

_MPY KO ; P = KO*e(n) 

APAC ; ACC = u(n-1)+KO*e(n)+K1*e(n-1) 
* ; +K2*e(n-2) 

SACH UN,1 ; STORE OUTPUT 

OUT UN,PA1l ; SEND IT 


The PID loop takes 13 cycles to execute (2.6 us at a 20-MHz clock rate or 1.3 
us at a 40-MHz clock rate). The TMS320 can also be used to implement more 
sophisticated algorithms such as state modeling, adaptive control, state esti- 
mation, Kalman filtering, and optimal control. Other functions that can be 
implemented are noise filtering, stability analysis, and additional control loops. 


6. Hardware Applications 


The TMS320C2x has the power and flexibility to satisfy a wide range of sys- 
tem requirements. The 128K address space for program and data memory can 
be used to interface external memories or to implement single-chip solutions. 
Peripheral devices can be interfaced to the TMS320C2x to perform analog 
signal acquisition at different levels of signa! quality. 


Information and examples on how to interface the TMS320C2x to external 
devices are presented in this section. The examples given are general enough 
to be easily adapted for a particular system requirement. For more detailed in- 
formation, refer to the application reports, “Hardware Interfacing to the 
TMS32020” and “TMS32020 and MC68000 Interface,” included in the book, 
Digital Signal Processing Applications with the TMS320 Family, Volume |. 
Refer also to the application report, Hardware Interfacing to the TMS320C25, 
published separately. Appendix F provides listings and brief information re- 
garding TI memories, peripherals, and analog conversion devices that are used 
in many of the applications in this section. 


The following buses, port, and control signals provide system interface to the 
TMS320C2x processor: 


16-bit address bus (A15-A0) 

16-bit data bus (D15-D0) 

Serial port 

PS, DS, IS (program, data, |/O space select) 
R/W (read/write) and STRB (strobe) 

READY and MSC (microstate complete) 

HOLD and HOLDA (hold acknowledge) 
INT(2-0) and SACK (interrupt acknowledge) 
BIO (branch control) and XF (external flag) 
SYNC (synchronization) and BR (bus request) 


Major hardware applications discussed in this section are listed below and on 
the next page.. 


e System Control Circuitry (Section 6.1 on page 6-3) 
= Powerup reset circuit 
= Crystal oscillator circuit 
ae User target design considerations when using the XDS 


@ Interfacing Memories (Section 6.2 on page 6-10) 
= Interfacing PROMs 
- Wait-state generator 
- Interfacing EPROMs 

Interfacing static RAMs 

interface timing analysis 


e Direct Memory Access (Section 6.3 on page 6-29) 
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Global Memory (Section 6.4 on page 6-32) 


Interfacing Peripherals (Section 6.5 on page 6-34) 


Combo-codec interface 
AIC interface 

D/A interface 

A/D interface 

1/O ports 


System Applications (Section 6.6 on page 6-45) 


Echo cancellation 

High-speed modem 

Voice coding 

Graphics and image processing 


High-speed control 


Instrumentation and numeric processing 
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6.1 System Control Circuitry 


The system control circuitry performs functions that are critical for proper 
system initialization and operation. A powerup reset circuit design and a 
crystal oscillator circuit design are presented in this section. The powerup re- 
set circuit assures that a reset of the part occurs only after the oscillator is 
running and stabilized. The oscillator circuit described allows the use of 
third-overtone crystals that are more readily available at frequencies above 20 
MHz. For a more detailed discussion of system control circuitry, refer to the 
application report, Hardware Interfacing to the TMS320C25. 


6.1.1 Powerup Reset Circuit 


The reset circuit shown in Figure 6-1 performs a powerup reset; i.e., the 
TMS320C2x is reset when power is applied. Note that the switch circuit must 
include debounce circuitry. Driving the RS signal low initializes the processor. 
Reset affects several registers and status bits (see Section 3.6.2 for a detailed 
description of the effect of reset on processor status). 


TMS320C25 


+5V 


Ry =1 Mo 


Cy =0.47 pF 


ij 


DGND 


Figure 6-1. Powerup Reset Circuit 
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For proper system initialization, the reset signal must be applied for at least 
three CLKOUT cycles, i.e., 300 ns for a TMS320C25 operating at 40 MHz. 
Upon powerup, it can take several to hundreds of milliseconds before the 
system oscillator reaches a stable operating state. Therefore, the powerup reset 
circuit should generate a low pulse on the reset line until the oscillator is S stable 
(i.e., 100 to 200 ms). 


The voltage on the reset pin (RS) is controlled by the R1C; network (see 
Figure 6-1). After a reset, this voltage rises exponentially according to the time 
constant R;Cj, as shown in Figure 6-2. 


VOLTAGE 


we = Vcc (1-e7t/7) 


Vee OS ae oe 


to =0 ty TIME 


Figure 6-2. Voltage on TMS320C25 Reset Pin 


The duration of the low pulse on the reset pin is approximately t;, which is the 
time it takes for the capacitor C; to be charged to 1.5 V. This is approximately 
the voltage at which the reset input switches from a logic level 0 to a logic 
level 1. The capacitor voltage is given by 


t 
V = Vcc [ S67 (1) 
where t = RjC; is the reset circuit time constant. Solving (1) for t gives 
Vv 
t= -RiC mn ft - ye | 2 
Vcc © 
For example, setting the following: 
Ry =1 MOQ Vec =5V 
C, = 0.47 uF V=V,=1.5V 


gives t = t; = 167 ms. In this case, the reset circuit of Figure 6-1 can generate 
a low pulse of long enough duration (167 ms) to ensure the stabilization of 
the oscillator upon powerup in most systems. 
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6.1.2 Crystal Oscillator Circuit 


The crystal oscillator circuit shown in Figure 6-3 is designed to operate at 
40.96 MHz. Since crystals with fundamental oscillation frequencies of 30 MHz 
and above are not readily available, a parallel-resonant third-overtone oscilla- 
tor is used. If a packed clock oscillator is used, oscillator design is of no 
concern. A third-overtone 40.96-MHz crystal is shown in Figure 6-3. 


TMS320C25 


40.96 MHz 


74ACO04 


C = 20 pF 
0.1 pF 


L = 1.8 pH 


- 


= = DIGITAL GROUND 


= 74AS04 


Figure 6-3. Crystal Oscillator Circuit 


The 74AS04 inverter in Figure 6-3 provides the 180-degree phase shift that a 
parallel oscillator requires. The 4.7-kQ resistor provides the negative feedback 
that keeps the oscillator in a stable state; i.e., the poles of the system are con- 
strained in a narrow region about the jw axis of the s-plane (analog domain). 
The 10-kQ potentiometer is used to bias the 74AS04 in the linear region. 


In a third-overtone oscillator, the crystal fundamental frequency must be at- 
tenuated so that oscillation is at the third harmonic. This is achieved with an 
LC circuit that filters out the fundamental, thus allowing oscillation at the third 
harmonic. 


The impedance of the LC network must be inductive at the crystal fundamental 
frequency and capacitive at the third harmonic. The impedance of the LC cir- 
cuit is given by 


L 
ee ee 
Z(w) = ; (3) 
j | ol - — 
l cl 
Therefore, the LC circuit has a pole at 
Pei Comme 
Wp = ; 
Pp JIC (4) 
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At frequencies significantly lower than wp, the 1/(wC) term in (3) becomes 
the dominating term, while wL can be neglected. This gives 


Z(w) = jJwL for w << wp (5) 


In (5), the LC circuit appears inductive at frequencies lower than wp. On the 
other hand, at frequencies much higher than wp, the wl term is the dominant 
term in (3), and 1/(wC) can be neglected. This gives 


Z(w) = joc for w >> Wp (6) 
The LC circuit in (6) appears increasingly capacitive as frequency increases 
above wp. This is shown in Figure 6-4, which is a plot of the magnitude of the 
impedance of the LC circuit of Figure 6-3 versus frequency. 


Based on the discussion above, the design of the LC circuit proceeds as fol- 
lows: Choose the pole frequency wp approximately halfway between the 
crystal fundamental and the third harmonic. The circuit now appears inductive 
at the fundamental frequency and capacitive at the third harmonic. 


In the oscillator of Figure 6-3, wo = 26.5 MHz, which is approximately half- 
way between the fundamental and the third harmonic; and C = 20 pF. Then, 
using (4), L = 1.8 pH. 


Jz (w)| 


INDUCTIVE 
REGION 


CAPACITIVE 
REGION 


1 : w 


Vic (rad/s) 


Figure 6-4. Magnitude of Impedance of Oscillator LC Network 
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6.1.3 User Target Design Considerations When Using the XDS 


The architecture for the TMS320C2x emulator (XDS) maximizes speed and 
performance. No external serial logic levels have been added to any of the 
address, data, or control signals other than those added to the setup times of 
READY, RS, BIO, and HOLD, and the propagation delay of HOLDA (hold ac- 
knowledge). The additional loading on outputs induced by the XDS is com- 
prehended in the XDS and TMS320C2x device design, thus allowing the user 
the full drive as specified in the TMS320C2x device data sheet. The DC load- 
ing characteristics of inputs is defined in Chapter 9 of the 7WS320C2x XDS 
User's Guide. 


The emulator architecture works closely with the user’s system design to allow 
the user’s memory to have maximum access times. Areas of close interaction 
between the emulator and target system are: 


@ Bus control 

@ READY timing and memory substitution 
@ Reset and hold 

e Miscellaneous considerations. 


Bus Control 


When the emulator is halted from the keyboard or any of the breakpoint 
functions, the current state of the device being emulated is extracted by the 
control processor. This processor communicates with the emulated device 
over the emulated device’s data bus. Additional communication is generated 
by commands entered from the keyboard. 


Before communication between the control processor and the device being 
emulated begins, the control processor generates an interlock sequence on the 
emulated device’s HOLD input in order to define data bus ownership. Once the 
_target HOLD is deactivated, this interlock prevents the target system from re- 
ceiving an active HOLDA until the emulator has completed accessing the pro- 
cessor resources. The emulator will not attempt to use the data bus until the 
interlock is successful, thus guaranteeing that it will not try to use the data bus 
when HOLDA is asserted to the target system. 


When communication between the control processor and the device being 
emulated is complete, the hold interlock is released, and the target system can 
again receive hold acknowledge when HOLD is asserted. At this point, the 
emulator is waiting for another command from the keyboard. Communication 
between the device being emulated and the contro! process occurs when DS, 
PS, IS, and HOLDA are all high. 


The target system should drive the data bus only when the following condi- 
tions are met: 


) HOLDA is active, or 
@ DS, PS, or IS is active and R/W is high. 


The XDS hardware uses the data bus only while the above signals are inactive. 
When these rules are not followed, the XDS gives a ‘PROCESSOR SYNC 
LOST’ 1160 error. This error may also be caused by signal-to-signal shorts in 
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the target system, misalignment of the target connector, poor grounding of the 
target connector, or wiring errors on the target system. 


READY and Memory Substitution 


Since the XDS adds one internal level of 7 ns in series with the READY input, 
the user’s system is left with only 10 ns to generate READY. This can be ac- 
complished by generating READY with a 10-ns TIBPAL16R4 device. READY 
should be generated off DS, PS, or IS and the decode of the address lines. 


The target system must present a valid READY high on each external access, 
even when using the XDS substitution memory. Suggested implementation 
of READY logic on the target system should hold READY high until target 
memory requiring wait states is addressed. 


The XDS provides two types of memory substitution: fast static RAM at a fixed 
address and slower dynamic RAM at mappable addresses. The user is re- 
sponsible for deselecting target memory residing in the same address of the 
emulator’s fast static memory if this emulator memory is mapped in. (Note that 
the target should not drive the data bus on a read.) This fast static emulator 
substitution memory consists of 8K words of fast static RAM, which can be 
individually mapped in as 4K words of program memory starting at address 
0000 and 4K words of data memory starting at location OOOO. In this case, the 
target system cannot drive the data bus even though DS or PS is active. Al- 
though this emulator static RAM can operate with zero wait states, target wait 
states can be modeled by the user using the target READY signal. However, 
this sensitivity requires that the target system eventually respond with a valid 
READY high or the emulator waits until it does. 


The slower dynamic RAM controls bus access through the DS or PS control 
signals. The target system can drive the data bus when PS or IS is asserted. 
Emulator logic assures that DS, PS, and IS are returned to their inactive state 
when the dynamic RAM substitution memory uses the data bus on reads. 


The dynamic RAM substitution memory always uses more than one clock to 
return data. An access to address space mapped to the dynamic substitution 
memory is accompanied by the assertion of DS or PS, and STRB. When the 
target logic generates a READY high condition, the device appears to com- 
plete the memory cycle by driving DS, PS, IS, or STRB to their inactive states, 
at their normal switching times. The device under emulation is held not ready 
for at least one extra clock cycle or until the memory substitution data is 
available. The memory substitution data is then driven onto the data bus on 
reads while all bus control signals at the target connector are high. 


Note that additional wait states can be added with the use of the target 
READY line. In this case, the memory control lines model the target access 
timing. However, the program cycle count is affected by the additional cycles 
internal to the emulator’s access of the dynamic RAM. Since the system re- 
sponds to the READY line, the target must eventually return a valid READY 
high on each access. 


Miscellaneous Considerations 


When the XDS is powered-up, the device under emulation is placed in the run 
mode with all memory substitution turned off. The control processor does not 
attempt to communicate with the device under emulation until the user com- 
municates with the emulator. If the target system is asserting RS, HOLD, or not 
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READY continuously to the device under emulation, the control processor 
cannot gain control of the device under emulation and reports a /PROCESSOR 
SYNC LOST’ 1160 error. This condition can be caused by a powered-up em- 
ulator plugged into a powered-down target system. Although the RS, HOLD, 
and READY are pulled up with resistors on the emulator, the impedance of the 
powered-down target system can assert a control signal or load the data bus 
so that the XDS cannot function properly. 


The conductive foam on the XDS target cable must be removed along with the 
foam on the logic show pod prior to XDS powerup. Failure to do so can also 
cause the ‘PROCESSOR SYNC LOST’ 1160 error. 


TMS320C25 Designs Using HOLD and HOLDA. When the target system 
asserts HOLD active low while the emulator is processing user-envoked com- 
mands requiring access of the device-under-emulation resources, the target 
will not receive HOLDA until the command is complete. 


When interfacing to dynamic RAM in the target system, the user should use 
READY rather than HOLD to insert refresh cycles. A user-invoked command 
could hold off HOLDA long:enough to lose charge in the dynamic cells. Like- 
wise, the refresh cycle in a "RAS ONLY REFRESH’ system could conflict with 
the emulator system controlling addressing during command processing if the 
address lines to the DRAMs are not buffered. 


Stack Usage. An interrupt is used to halt the device being emulated, thereby 
using one of the emulated device stack locations. When an XDS is to be used, 
the applications programmer should reserve one level of the stack for code 
development. 


Transmission Line Phenomena. Since the XDS target cable is approxi- 
mately 20 inches, use of advanced CMOS or fast/advanced Schottky TTL may 
cause line reflections (ringing above input thresholds) on input lines to the 
XDS. Series termination resistors (22 to 68 ohms) can help to eliminate this 
problem. In some cases where significant additional signal length is added to 
XDS outputs, the series resistors on the XDS may not be sufficient to control 
reflections. !n this case, additional corrective actions may be necessary. 


Clock Source. The XDS does not support the use of a crysta! in the target 
system. The emulator’s clock source can be selected from three sources: 


1)  Acclock (with TTL levels) driven up the target cable on pin F11 (PGA) 
or pin 35 (PLCC), ; 

2) A-socketed changeable crystal on the emulator board (Y1), or 

3) Asocketed changeable canned TTL oscillator on the EMU (U9). 


TMS32020/TMS320C25. The XDS supports both the TMS32020 and 
TMS320C25. The operating frequencies are 20 MHz and 40 MHz, respec- 
tively. The unit is shipped configured as a TMS320C25 emulator, but it can 
easily be converted to a TMS32020 emulator by replacing the TMS320C25 
device on the emulator with the TMS32020 device found in the spare parts 
kit. The crystal, TTL oscillator, and/or input clock frequency must be adjusted 
to correspond to TMS32020 specifications. See Section 9 in the 7MS320C2x 
XDS User's Guide for additional timing and loading information. 
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6.2 Interfacing Memories 


The TMS320C2x can be interfaced with PROMs, EPROMs, and static RAMs. 
The speed, cost, and power limitations imposed by a particular application 
determine the selection of a specific memory device. If speed and maximum 
throughput are desired, the TMS320C2x can run with no wait states. In this 
case, memory accesses are performed in a single machine cycle. Alternatively, 
slower memories can be accessed by introducing an appropriate number of 
wait states or slowing down the system clock. The latter approach is more 
appropriate when interfacing to memories with access times slightly longer 
than those required by the TMS320C2x at full speed. 


When wait states are required, the number of wait states depends on the me- 
mory access time (see Section 6.2.3). With no wait states, the READY input 
to the TMS320C2x can be pulled high. If one or more wait states are required, 
the READY input must be driven low during the cycles in which the 
TMS320C2x enters a wait state. 


The TMS320C2x implements two separate and distinct memory spaces: pro- 
gram space (64K words) and data space (64K words). Distinction between 
the two spaces is made through the use of the PS (program space) and DS 
(data space) pins. A third space, the I/O space, is also available for interfacing 
with peripherals. This space is selected by the IS (1/O space) pin, and is dis- 
cussed in Section 6.5. 


The following brief discussion describes the TMS320C2x read and write cy- 
cles. For the memory read and write timing diagrams, refer to the TMS320C2x 
Data Sheet of Appendix A. For further information about read and write op- 
eration, see Sections 3.7.3 and 3.7.4. Throughout this section, Q is used to 
indicate the duration of a quarter phase of the output clock (CLKOUT1 or 
CLKOUT2). Memory interfaces discussed in this section assume that the 
TMS320C2x is running at 40 MHz; i.e., Q = 25 ns. 


In a read cycle, the following sequence occurs: 


1) Near the beginning of the machine cycle (CLKOUT1 goes low), the ad- 
dress bus and one of the memory select signals (PS, DS, or IS) becomes 
valid. R/W goes high to indicate a read cycle. 


2) STRB goes low no less than tgy(ay = Q-12 ns after the address bus is 
valid. 


3) Early in the second half of the cycle, the READY input is sampled. 
READY must be stable (low or high) at the TMS320C25 no later than 
ta(SL-R) = Q-20 ns after STRB goes low. 


4) With no wait states (READY is high), data must be available no later 
than ta(s_L) = 20-23 ns after STRB goes low. 


The sequence of events that occurs during an external write cycle is the same 
as the above, with the following differences: 


1) | R/W goes Iow to indicate a write cycle. 


2) The data bus begins to be driven approximately concurrently with STRB 
going low. 
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3) The data bus enters a high-impedance state no later than tgis(p) = Q+15 
ns after STRB goes high. 


6.2.1 Interfacing PROMs 


Program memory in a TMS320C2x system can be implemented through the 
use of PROMs. Two different approaches for interfacing PROMs to the 
TMS320C2x can be taken depending on whether or not any of the memories 
in the system require wait states. When no wait states are required for any of 
the memories, READY can be tied high, and the interface to the PROMs be- 
comes a direct connection. In this first approach, address decoding is not re- 
quired since the system contains only a small amount of one type of memory. 
When some of the system memories require wait states, address decoding 
must be performed to distinguish between two or more memory types with 
different access times. In the second approach, a valid READY signal that 
meets the TMS320C2x timing requirements must be provided. An efficient 
method of accomplishing this is to use one section of circuitry to generate the 
address decode, and a second, independent section to generate the READY 
signal. These two approaches are discussed in this section. For more detailed 
information, see Hardware Interfacing to the TMS320C25. 


An example of a no-wait-state memory system is the direct PROM interface 
design shown in Figure 6-5. In this design, the TMS320C25 is interfaced with 
the Texas Instruments TBP38L165-35, a low-power 2K x 8-bit PROM. The 
interface timing for the design of Figure 6-5 is shown in Figure 6-6. The same 
techniques can be used with both of the TMS320C2x devices. The 
TMS320C25 has been chosen for the following examples because it has the 
most stringent timing requirements. 


The TMS320C25 expects data to be valid no later than 20-23 ns after STRB 
goes low. (This is 27 ns for a TMS320C25 operating at 40 MHz.) The access 
times of the TBP38L165-35 are 35 ns maximum from address taay, and 20 
ns maximum from chip enable tais). On the TMS320C25, address becomes ~ 
valid a minimum of ts, = Q-12 ns = 13 ns before STRB goes low. Therefore, 
the data appears on the data bus within 27 ns after STRB goes low, as required 
by the TMS320C25. 


When a read cycle is followed by a write cycle, care must be taken to avoid 
bus conflict. Bus conflict also may occur when a TMS320C25 write cycle is 
followed by a memory read cycle. In this case, the TMS320C25 data lines 
must enter a high-impedance state before the memory starts driving the data 
bus. 
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Figure 6-5. Direct Interface of TBP38L165-35 to TMS320C25 


Hardware Applications - Interfacing Memories 


aout KN 


XX 
WKY 


D15-DO 1 DATAIN |! 


Figure 6-6. Interface Timing of TBP38L165-35 to TMS320C25 


The most critical timing parameters of the TBP38L165-35 direct interface to 
the TMS320C25 are summarized in Table 6-1. 


Table 6-1. Timing Parameters of TBP38L165-35 Direct Interface to 
TMS320C25 


DESCRIPTION SYMBOL USED IN VALUE 
FIGURE 6-6 


Address setup time tsy 13 ns (min) 
TMP38L165-35 access time from chip enable tas) 20 ns (max) 
TMP38L165-35 disable time tdis 15 ns (max) 


The second design example considers the interface of PROMs to the 
TMS320C25 using address decoding. An approach that can be used to meet 
the READY timing requirements is shown in Figure 6-7. This design utilizes 
one address decoding scheme to generate READY, and a second address de- 
coding scheme to enable the different memory banks. In this design, the 
memories with no wait states are mapped at the upper half (upper 32K) of the 
program space. The lower half is used for memories with one or more wait 
states. This decoding is implemented with the 74AS20 four-input NAND gate. 


Address decoding is implemented by the 74AS138. This decoding separates 
the program space into eight segments of 8K words each. The first four of 
these segments (lower 32K of address space) are enabled by the YO, YT, Y2, 
and Y3 outputs of the 74AS138. These segments are used for memories with 
One or more wait states. The other four segments select memories with no wait 
states (the TBP38L165s are mapped in segment 5, starting at address >8000). 
Note that in Figure 6-7, R/W is used to enable the 74AS138. This prevents a 
bus conflict from occurring if an attempt is made to write to the PROMs. Fig- 
ure 6-8 shows the timing for the circuit shown in Figure 6-7. READY goes 
high 10 ns (worst case) after the address has become valid. 


a Aah e co es 
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Figure 6-7. Interface of TBP38L165-35 to TMS320C25 
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Figure 6-8. Interface Timing of TBP38L165-35 to TMS320C25 (Address 
Decoding) 


The most critical timing parameters of the TBP38L165-35 interface with ad- 
dress decoding to the TMS320C25 are summarized in Table 6-2. 


Table 6-2. Timing Parameters of TBP38L165-35 to TMS320C25 
(Address Decoding) 


DESCRIPTION SYMBOL USED IN VALUE 
; FIGURE 6-8 


Propagation delay through the 74AS04 5 ns (max) 


Propagation delay through the 74AS138 10 ns (max) 
Address valid to READY 10 ns (max) 
TBP38L165-35 disable time i 15 ns (max) 
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6.2.2 Wait-State Generator 


The READY input of the TMS320C2x allows the capability to interface with 
memory and peripherals that cannot be accessed in a single cycle. The num- 
ber of cycles in a memory or I/O access is determined by the state of the 
READY input. If READY is high when the TMS320C2x samples the READY 
input, the memory access ends at the next falling edge of CLKOUT1. If READY 
is low, the memory cycle is extended by one machine cycle, and all other sig- 
nals remain valid. Figure 6-9 shows a one-wait state memory access. Note that 
for on-chip program and data memory accesses, the READY input is ignored. 
Refer to Hardware Interfacing to the TMS320C25 for detailed information re- 
garding wait-state generation. 


The automatic generation of one wait state can be accomplished by the use 
of the MicroState Complete (MSC) signal. The MSC output is asserted low 
during CLKOUT1 low to indicate the beginning of an internal or external 
memory or I/O operation (see Figure 6-9). By gating MSC with the address 
and PS, DS, and/or IS, a one-wait READY signal can be generated. 


An alternative approach for the generation of wait states when interfacing with 
memories and peripherals consists of a wait-state generator. In this design, 
READY must be valid (low or high) no later than Q-20 ns = 5 ns after STRB 
goes low. If READY is high, then the memory/peripheral access is completed 
with the present machine cycle. If READY is low, the access is extended to the 
next machine cycle; i.e., a wait state is introduced. The number of wait states 
required depends on the access time tg of the particular memory device or 


- peripheral. If tz; < 40 ns, no wait states are required. If 40 ns < ta < 140 ns, 


one wait state must be inserted. In general, N wait states are required for a 
particular access if 


TMS32020: [200(N-1) + 85]ns < ty < [200N + 85]ns 
TMS320C25: [100(N-1) + 40]ns < tg < [100N + 40]ns 
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Figure 6-9. One Wait-State Memory Access Timing 


The information on the number of wait states required for a memory or pe- 
ripheral access is summarized in Table 6-3. 


Table 6-3. Wait States Required for Memory/Peripheral Access 


NUMBER OF WAIT TMS32020 TMS320C25 
STATES REQUIRED ACCESS TIME ACCESS TIME 


tg < 85 ns tg < 40 ns 


85 ns <tg < 285ns | 40 ns < tg < 140 ns 

285 ns < tg < 485 ns] 140 ns < tg < 240 ns 
485 ns < tg < 685 ns] 240 ns < tg < 340 ns 
685 ns < tg < 885 ns{ 340 ns < ty < 440 ns 


A wait-state generator design and timing are shown in Figure 6-10 and Figure 
6-11, respectively. In the case of one wait state, time t; in Figure 6-11 is the 
time from address valid to memory select of the particular device that requires 
the wait state. This corresponds to the propagation delay through the address 
decode logic. For a 74AS138 decoder, t; = 10 ns (max). 


Time to is the time from memory select going low to CLKOUT2 going low. 
to = tp + ty = 11 ns + 20 ns = 31 ns 
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Time tg is the time from CLKOUT2 going low to READY going high. 
tg =19ns+5ns = 24ns 


READY must remain high until it is sampled again, shortly after CLKOUT1 
goes high. In Figure 6-10, READY remains high well after CLKOUT1 goes 
high. At the falling edge of CLKOUTZ2, the inputs to the J-K flip-flop are J = 
1 and K = Q = 1, and the flip-flop is in the toggle mode. When CLKOUT2 
goes low, G goes back to logic 1. READY goes low and stays low until one 
of the inputs of the 74AS30 is pulled low. 


To implement two wait states, a second J-K flip-flop is utilized as shown in 
Figure 6-10. This delays READY going high by an additional machine cycle 
(see Figure 6-11). If more wait states are required, additional J-K flip-flops 
must be included in the wait-state generator design. 


% T4ALS20A 


READY 
TO 
TMS320C25 
FROM 
TMS320C25: 


CLKOUT2 
RS 


T Connections to other devices in the system that require two wait states. (Inputs not used by other 


devices should be pulled up.) 


+ Connections to other devices in the system that require one wait state. (Inputs not used by other 


devices should be pulled up.) 


§ Connections to other devices in the system that require zero wait states. (Inputs not used by other 


devices should be pulled up.) 


Figure 6-10. Wait-State Generator Design 
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Figure 6-11. Wait-State Generator Timing 


6.2.3 Interfacing EPROMs 


EPROMs can be a valuable tool for debugging TMS320C2x algorithms during 
the prototyping stages of a design, and may even be desirable for production. 
Two different EPROM interfaces to the TMS320C2x. are discussed: a direct 
interface of an EPROM that requires no wait states, and EPROM interfaces 
that require one and two wait states. 


A direct interface similar to that used for PROMs may be implemented when 
EPROM access time meets the TMS320C2x timing specifications. A Texas 
Instruments TMS27C292-35 2K x &-bit EPROM can interface directly to the 
TMS320C25 with no wait states. The TMS27C292-35 is a CMOS EPROM 
with access times of 35 ns from valid address and 25 ns from chip select. 


When slower, less costly EPROMs are used, a simple flip-flop circuit (see 
Section 6.2.2 for wait-state generator design) can be used to generate one or 
more wait states. Figure 6-12 shows an EPROM interface with one wait state, 
where Wafer Scale WS57C64F-12 8K x 8-bit EPROMs are interfaced to the 
TMS320C25. The WS57C64F-12 is the slowest member of the WS57C64F 
EPROM series, but still meets the specifications for one wait state. With 
slower EPROMs, however, data output turnoff can be slow, and must be taken 
into consideration in the design. The WS57C64F-12s are mapped at address 
>2000. Figure 6-13 provides ihe interface timing diagram. 
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Figure 6-12. Interface of WS57C65F-12 to TMS320C25 
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Figure 6-13. Interface Timing of WS57C65F-12 to TMS320C25 


Table 6-4 summarizes the most critical timing parameters of the 
WS57C64F-12 interface to the TMS320C25. 


Table 6-4. Timing Parameters of WS57C64F-12 Interface to 


TMS320C25 
DESCRIPTION SYMBOL USED IN VALUE 
FIGURE 6-13 
Address valid to MEMSEL low ty 10 ns (max) 


STRB low to DTSTR low 

TMS320C25 address valid to WS57C64F-12 
data valid 

STRB high to WS57C64F-12 output disable 


5.8 ns (max) 
130 ns (max) 


40.8 ns (max) 


An EPROM interface with two wait states is shown in Figure 6-14, in which 
the TMS27C64-20 is interfaced to the TMS320C25. The TMS27C64-20 is a 
CMOS 8K x 8-bit EPROM with an access time of 200 ns. The timing diagram 
is shown in Figure 6-15. 
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Figure 6-14. Interface of TMS27C64-20 to TMS320C25 
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Figure 6-15. Interface Timing of TMS27C64-20 to TMS320C25 


Table 6-5 summarizes the most critical timing parameters of the 
TMS27C64-20 interface to the TMS320C25. 


Table 6-5. Timing Parameters of TMS27C64-20 Interface to 
TMS320C25 


Address valid to MEMSEL low 10 ns (max) 


STRB low to DTSTR low 5.8 ns (max) 

TMS320C25 address valid to TMS27C64-20 220 ns (max) 
data valid 

STRB high to TMS27C64-20 output disable 18.8 ns (max) 


For detailed information regarding EPROM interfacing, see the apaplication 
report, Hardware Interfacing to the TMS320C25. 
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6.2.4 Interfacing Static RAMs 
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Interfacing external RAM to the TMS320C2x can be useful for expanding in- 
ternal data memory or implementing additional RAM program memory. Static 
RAM can be used as data memory to extend the TMS320C2x 544-word in- 
ternal RAM. When used as program memory, object code can be downloaded 
into the RAM and executed. In the first case, the static RAM is mapped into 
the data space, while in the second case it is mapped into the program space. 


In cases where RAMs of different speeds are used, separate schemes for ad- 
dress decoding and READY generation can be used to meet READY timing 
requirements in a similar manner to that used for the PROM interface de- 
scribed in Section 6.2.1. RAMs with similar access times may then be grouped 
together in one segment of memory. 


The static RAM for this interface is the Cypress Semiconductor CY7C169-25 
4K x 4-bit static RAM. This RAM has a 25-ns access time from address ta) 
and a 15-ns access time from chip enable ta(ce). Note that these access times 
are fast enough so that a wait-state generator is not required for this interface. 
If, however, RAMs that require wait states are used in the system, the wait- 
state generator described in Section 6.2.2 can be used. . 


The design shown in Figure 6-16 utilizes a similar approach to the one de- 


scribed in Sections 6.2.1 and 6.2.3; i.e., one address decoding scheme is used 


to generate READY, and a second address decoding scheme enables the static 
RAM. In this design, RAMs with no wait states are mapped at the lower half 
(lower 32K words) of the TMS320C25 data space. The upper half is used for 
memories with one or more wait states. Figure 6-17 shows the timing for 
memory read and write cycles. 


Table 6-6 summarizes the most critical timing parameters of the CY7C169-25 
interface to the TMS320C25. 


Table 6-6. Timing Parameters of CY7C169-25 Interface to 
TMS320C25 


DESCRIPTION SYMBOL USED IN VALUE 
FIGURE 6-17 


Address valid to READY valid 10.8 ns (max) 

STRB low to MEMSEL low 8.5 ns (max) 

STRB high to MEMSEL high 7.5 ns (max) 

CLKOUT1 low to TMS320C25 data bus 15 ns (max) 
entering the high-impedance state 


MEMSEL low to CY7C169-25 driving the 5 ns (min) 
data bus 

MEMSEL low to CY7C169-25 data valid 15 ns (max) 

MEMSEL high to CY7C169-25 entering 15 ns (max) 
the high-impedance state 

Data setup time for a write 32 ns (min) 

Data hold time 7.5 ns (min) 
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Figure 6-16. Interface of CY7C169-25 to TMS320C25 
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Figure 6-17. Interface Timing of CY7C169-25 to TMS320C25 
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6.2.5 Interface Timing Analysis 


When interpreting TMS320C25 timing specifications, particularly in the area 
of memory interface timing, it is necessary to understand clock input and clock 
timing relationships shown in timing diagrams as compared with the actual 
data sheet specifications. If interpreted incorrectly, the specifications may 
suggest that interfacing to the device is more constrained than necessary. 
Without exception, the TMS320C25 meets every specification given in the 
data sheet (Appendix A). Some timings are specified more conservatively than 
others, due to yield distributions, etc., but at the minimum stated, each 
TMS320C25 is guaranteed by Texas Instruments to conform explicitly to the 
data sheet timings. 


Clock input and internal clock timing relationships must be considered in the 
interpretation of output timing characteristics and requirements. At the clock 
input to the device, only the rising edges of the clock are used to initiate 
transitions on internal clocks and output signals. Thus, with an input clock of 
a stable frequency (regardless of duty cycle variation within specifications), 
extremely symmetric timing is exhibited throughout the device. A significant 
consequence of this is that CLKOUT1, CLKOUT2, and STRB timing skews 
(with respect to each other) and high and low pulse widths are integer mul- 
tiples of Q (the input clock period or one-fourth of the output clock period) 
to within a few nanoseconds. This occurs because transitions on the output 
signals are initiated directly from the internal clocks (Q1-Q4), and driven 
through identical output buffer circuits. Since the internal clocks are very 
symmetric, close tracking of these outputs results. The large skews in these 
timings, as shown in the data sheet, are a factor of temperature and process. 
Since there is no variation in process and negligible variation in temperature 
across a single device, the skew of the outputs relative to the inputs is con- 
sistent for all outputs. Regardless of the magnitude of such skews, interfaces 
to the TMS320C25 can be designed independent of these skews in most 
cases. 


Interface timings to be considered include READY, memory read, and MSC 
timings. With regard to READY, there are two pairs of related timings in which 
one timing can be met without the other one being met, with the device still 
guaranteed to function properly. These pairs of timings are tg(SL-R) and 
ty(C2H-R), and th(SL-R) and t,}(C2H-R). These front-end and back-end 
READY timings are specified with respect to STRB and CLKOUTZ2. For zero 
wait-state accesses, READY is referenced to STRB, but for wait-state accesses, 
STRB remains low and another timing reference is required. Note that the ac- 
tual timings for each of these parameter pairs are identical and the timings with 
respect to CLKOUT2 and STRB are equivalent. Therefore, if READY timing 
meets the requirements with respect to one of these references (but not nec- 
essarily the other), the timing requirements of the device are satisfied regard- 
less of the actual skews between the two signals. For the purpose of interface 
timing, tg(C2-S) can be assumed to be 0 ns with respect to other signals on 
the TMS320C25. The same is also true of tg(C1-S) and ty(SL); these timings 
can be assumed to be Q and 2Q, respectively. These relationships are ac- 
counted for in specifications and device testing. 


In memory read operations, the two key timings, ta(A) and ts,(D)R, are related 
by ta(A) = tsy(A) + tw(SL) - tsy(D)R. However, when the worst case tw(SL) 
specifications are used in this equation to generate an expression for ta(A), the 
result differs from the specification for ts(A) in the data sheet. Both the spec- 
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ification for ta(A) and ts,(D)R are tested explicitly on the device and guaran- 
teed. This again justifies the assumption that ty(SL) can be assumed to be 
2Q with respect to other signals on the device. This is confirmed by the fact 
that if ty(SL) = 2Q is used to calculate t,(A), consistency results in all of 
these related timings. If an interface is designed where tsy(D)R is met but 
ta(A) is not met due to actual signal skews, the interface is still guaranteed to 
function with the TMS320C25. The same is true (but not as likely in reality) 
if an interface is designed where tg(A) is met but t,,(D)R is not. Thus, even 
if tw(SL) is actually less than 20, meeting either t3(A) or tg,(D)R is still suf- 
ficent to guarantee a valid memory cycle since both parameters are guaranteed 
independently. 


Note that when considered in the absolute sense, timings such as ty(SL) will 
have some finite tolerance, although considerably less than that specified. For 
example, if STRB is used to generate a WE pulse for a device that specifies a 
minimum WE low pulse width, the data sheet specification for STRB low pulse 
width must be used for a worst-case design. 


With regard to MSC timing, the t,(C2H-R) timing is a constraint that must be 
satisfied, and the tg(MSC) is a parameter more conservatively specified than 
many other timings. When considering these timing parameters and 
CLKOUT1/CLKOUT2 skews, the MSC does not meet worst-case timings for 
generating READY, the purpose for which the MSC signal was intended. The 
READY timing will be met by MSC, however, regardless of when MSC goes 
high. This timing is explicitly guaranteed by t,(M-R) = 0, even though MSC 
exhibits some finite skew from CLKOUT1. 
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6.3 Direct Memory Access (DMA) 


Some advanced hardware design concepts supported by the TMS320C2x in- 
clude Direct Memory Access (DMA) and global memory (see Section 6.4). 
Direct memory access can be used for multiprocessing by temporarily halting 
the execution of one or more processors to allow another processor to read 
from or write to the halted processor’s local off-chip memory. Direct memory 
access to external program/data memory is performed using the HOLD and 
HOLDA signals. 


The multiprocessing is typically a master-slave configuration. The master may 
initialize a slave by downloading a program into its program memory space 
and/or provide the slave with the necessary data to complete a task. In a 
typical TMS320C2x direct memory access scheme, the master may be a gen- 
eral-purpose CPU, another TMS320C2x, or perhaps even an analog-to-digital 
converter. A simple TMS320C2x master-slave configuration is shown in Fig- 
ure 6-18. The master TMS320C2x takes complete control of the slave’s ex- 
ternal memory by asserting HOLD !ow via its external flag (XF). This causes the 
slave to place its address, data, and control lines in a high-impedance state. 
By asserting RS in conjunction with HOLD, the master processor can load the 
slave’s local program memory with the necessary initialization code on reset 
or powerup. The two processors can be synchronized using the SYNC pin to 
make the transfer over the memory bus faster and more efficient. 


After control of the slave’s buses is given up to the master processor, the'slave 
alerts the master of this fact by asserting HOLDA. This signal may be tied to the 
master TMS320C2x’s BIO pin. The slave’s XF pin may be used to indicate to 
the master when it has finished performing its task and needs to be repro- 
grammed or requires additional data to continue processing. In a multiple 
slave configuration, the priority of each slave’s task may be determined by ty- 
ing the slave’s XF signals to the appropriate INT(2-0) pin on the master 
TMS320C2x. . 
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TMS320C 2x TMS320C2x 
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Figure 6-18. Direct Memory Access Using a Master-Slave 
Configuration 


A PC environment presents another example of a potential direct memory ac- 
cess scheme where a system bus (the PC-bus) is used for data transfer. In this 
configuration, either the master CPU or a disk controller may place data onto 
the system bus, which can be downloaded into the local memory of the 
TMS320C2x. Here the TMS320C2x acts more like a peripheral processor with 
multifunction capability. In a speech application, for example, the master can 
load the TMS320C2x’s program memory with algorithms to perform such 
tasks as speech analysis, synthesis, or recognition, and fill the TMS320C2x’s 
data memory with the required speech templates. In another application ex- 


“ample, the TMS320C2x can serve as a dedicated graphics engine. Programs 


can be stored in TMS320C2x program ROM or downloaded via the system 
bus into program RAM. Data can come from PC disk storage or provided di- 
rectly by the master CPU. 


Figure 6-19 depicts a direct memory access using a PC environment. In this 
configuration, decode and arbitration logic is used to control the direct mem- 
ory access. When the address on the system bus resides in the local memory 
of the peripheral TMS320C2x, this logic asserts the HOLD signal of the 
TMS320C2x while sending the master a not-ready indication to allow wait 
states. After the TMS320C2x acknowledges the direct memory access by as- 
serting HOLDA, READY is asserted and the information transferred. 
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DATA 


DISK 
CONTROLLER 


DATA 


Figure 6-19. Direct Memory Access in a PC Environment 
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6.4 Global Memory 
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For multiprocessing applications, the external memory of the TMS320C2x can 

be divided into local and global sections. Special registers and pins included 

on the TMS320C2x allow multiple processors to share up to 32K words of 

global data memory space. This implementation facilitates efficient “shared 

data” multiprocessing where data is transferred between two or more proces- 

sors. Unlike a direct memory access (DMA) scheme, reading or writing global . 
memory does not require one of the processors to be halted. 


Global memory can be used in various digital signal processing tasks such as 
filters or modems, where the algorithm being implemented may be divided into 
sections with a distinct processor dedicated to each section. In this multi- 
processor scheme, the first and second processors may share global data 
memory, as well as the second and third, the third and fourth, etc. Arbitration 
logic is required to determine which section of the algorithm is executing and 
which processor has access to the global memory. With multiple processors 
dedicated to distinct sections of the algorithm, throughput may be increased 
via pipelined execution. . 


The size of the global memory is programmable between 256 and 32K lo- 
cations in data memory by loading the global register (GREG). After global 
memory is defined in the GREG, the TMS320C2x asserts the BR (bus request) 
signal before each global memory access. The processor then inserts wait 
states until a bus grant is given by asserting the READY line. Figure 6-20 il- 
lustrates such a global memory interface. Since the processors can be syn- 
chronized by using the SYNC pin, the arbitration logic may be simplified and 
the address and data bus transfers more efficient (see Section 3.10.1 for in- 
formation on synchronization). 


The SYNC pin on the TMS320C2x may also be used to synchronize several 
processors to allow for execution of redundant fail-safe systems. SYNC permits 
instruction broadcasting between several processors and lock-step execution 
after initial synchronization. 
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Figure 6-20. Global Memory Communication 
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6.5 Interfacing Peripherals 


Most DSP systems implement some amount of I/O using peripherals in addi- 
tion to any memory included in the system. This usually includes analog input 
and output, which can be performed through the parallel and serial 1/O ports 
on the TMS320C2x. 


When accessing the external parallel 1/O ports, the access to the data bus is 
multiplexed over the same pins as for a program/data memory access. The 1/0 
space is selected by the IS signal going active Jow, and the address of the port 
is placed on address bits A3-AO. Address bits A15-A4 are held low. 


This section describes hardware interfaces to a TCM29C16 combo-codec, a 
TLC32040 analog interface circuit (AIC), a digital-to-analog (D/A) converter, 
and an analog-to-digital (A/D). 


6.5.1 Combo-Codec Interface 
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Some areas of speech, telecommunications, and many other applications re- 
quire low-cost analog-to-digital (A/D) and digital-to-analog (D/A) convert- 
ers. Combo-codecs are most effective in serving DSP system data-conversion 
requirements. Combo-codecs are single-chip pulse-code-modulated encod- 
ers and decoders (PCM codecs), designed to perform the encoding (A/D 
conversion) and decoding (D/A conversion), as well as the antialiasing and 
smoothing filtering functions. Since combo-codecs perform these functions 
in a single 300-mil DIP package at low cost, they are extremely economical 
for providing system data-conversion functions. 


Combo-codecs interface directly to the TMS320C2x by means of the serial 
port and provide a companded, PCM -coded digital representation of analog 
input samples. This PCM code is easily translated into linear form by the 
TMS320C2x for use in processing. The design discussed here and shown in 
Figure 6-21 uses a Texas Instruments TCM29C16 codec, interfaced using the 
serial port of the TMS320C25. 


The TMS320C2x serial port provides direct synchronous communication with 


serial devices. The interface signals are compatible with codecs and other se- 


rial components so that minimal external hardware is required. Externally, the 
serial port interface is implemented using the following pins. on the 
TMS320C25: 


t DX (transmitted serial data) 

CLKX (transmit clock) 

FSX (transmit framing synchronization signal) 
DR (received serial data) 

CLKR (receive clock) 

e FSR (receive framing synchronization signal) 


Data on DX and DR are clocked by CLKX and CLKR, respectively. These 
clocks are only required during serial transfers on the TMS320C25. On the 
TMS32020, the clocks must be present at all times if the serial port is being 
used. Note that the TMS320C25 is double-buffered. 
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Figure 6-21. Interface of TMS320C25 to TCM29C16 Codec 


Serial port transfers are initiated by framing pulses on the FSX and FSR pins 
for transmit and receive operations, respectively. For transmit operations, the 
FSX pin can be configured as an input or an output. This option is selected 
by the transmit mode (TXM) bit of status register ST1. In this design, FSX is 
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assumed to be configured as an input; therefore, transmit operations are initi- 
ated by a framing pulse on the FSX pin. Upon completion of receive and 
transmit operations, an RINT (serial port receive interrupt) and an XINT (serial 
port transmit interrupt) are generated, respectively. Interface timing of the 
TMS320C25 to the TCM29C16 corresponds to the burst-mode serial port 
transmit and receive operations shown in Figure 3-28 and Figure 3-29, re- 
spectively. Continuous-mode operation using framing pulses or without 
framing pulses is also possible. 


The format (FO) bit of status register ST1 is used to select the format (8-bit 
byte or 16-bit word) of the data to be received or transmitted. For interfacing 
the TMS320C25 to a codec, the format bit should be set to 1, formatting the 
data in 8-bit bytes. 


The TMS320C25 interfaces directly to the codec, as shown in Figure 6-21, 
with no additional logic required. The PCM u-law data generated by the codec 
at the PCMOUT pin is read by the TMS320C25 from the data receive (DR) 
pin, which is internally connected to the receive serial register (RSR). The data 
transmitted from the data transmit (DX) pin of the TMS320C25 is received 
by the PCMIN input of the codec. During the digital-to-analog conversion, 
this p-law companded data must be converted back to a linear representation 
for use in the TMS320C25. The resulting analog waveform is lowpass-filtered 
by the codec’s internal smoothing filter. Therefore, no additional filtering is 
required at the codec output (PWRO+). Software companding routines ap- 
propriate for use on the TMS320C25 are provided in the book, Digital Signal 
Processing Applications with the TMS320 Family. 


The software required to initialize the TMS320C25-codec interface is provided 
in the Combo-Codec Interface section of the application report, Hardware /In- 
terfacing to the TMS320C25. This report also presents detailed information 
regarding codec interfacing. 


A combo-codec configured in the fixed-data-rate mode requires the following 
external clock signals: 


@ A 2.048-MHz clock to be used as the master clock, and 
@ 8-kHz framing pulses to initialize the data transfers. 


Both of these signals can be derived from the 40.96-MHz system clock with 
appropriate divider circuitry. This is the primary justification for selecting 
40.96-MHz as the system clock frequency. The clock divider circuit consists 
of a 74AS74 D-type flip-flop, 74HC390 decade counter, and 74AS869 8-bit 
up/down counter. The hardware connections between these devices are 
shown in Figure 6-21. 


To generate the 2.048-MHz master clock for the combo-codec, a division by 
20 of the 40.96-MHz system clock is required. The 74HC390 contains on- 
chip two divide-by-2 and two divide-by-5 counters. Since the 74HC390 
cannot be clocked with frequencies above approximately 27 MHz, a 74AS74 
configured as a divide-by-2 of the 40.96-MHz clock is used. 


The 74AS869 is configured to generate the 8-kHz clock pulse (the ripple carry 
output is 2.048 MHz/256 = 8 kHz). This pulse is used by the TMS320C25 
and codec as a framing pulse to initiate data transfers. 


The level of the analog input signal is controlled using the TLO72 opamp 
connected in the inverting configuration (see Figure 6-21). Using the 
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500-kQ potentiometer, the gain of this circuit can be varied from 0 to 5. The 
output of the 0.01-yF coupling capacitor drives the TCM29C16’s internal 
opamp. This opamp is connected in the inverting configuration with unity gain 
(feedback and input impedances having the same value of 100 kQ). 


6.5.2 AIC Interface 


For applications such as modems, speech, control, instrumentation, and ana- 
log interface for DSPs, a complete analog-to-digital (A/D) and digital-to-an- 
alog (D/A) input/output system on a single chip may be desired. The 
TLC32040 analog interface circuit (AIC) integrates on a single monolithic 
CMOS chip a bandpass, switched-capacitor, antialiasing-input filter, 14-bit 
resolution A/D and D/A converters, and a lowpass, switched-capacitor, out- 
put-reconstruction filter. The TLC32040 offers numerous combinations of 
master clock input frequencies and conversion/sampling rates, which can be 
changed via digital processor control. 


Four serial port modes on the TLC32040 allow direct interface to TMUS320C2x 
processors. When the transmit and receive sections of the AIC are operating 
synchronously, it can interface to two SN54299 or SN74299 serial-to- parallel 
shift registers. These shift registers can then interface in parallel to the 
TMS320C2x, other TMS320 digital signal processors, or to external FIFO cir- 
cuitry. Output data pulses are emitted to inform the processor that data 
transmission is complete or to allow the DSP to differentiate between two 
transmitted bytes. A flexible control scheme is provided so that the functions 
of the AIC can be selected and adjusted coincidentally with signal processing 
via software control. Refer to the TLC32040 data sheet for detailed informa- 
tion on timing and device functions. 


The AIC is easily interfaced to the TMS320C2x serial ports, as shown in Figure 
6-22. The TMS320C2x can communicate with the AIC either synchronously 
or asynchronously depending on the information in the control register. The 
operating sequence for synchronous communication with the TMS320C2x, 
shown in Figure 6-23, is as follows: 


1) The FSX or FSR pin is brought low. 

2) One 16-bit word is transmitted or one 16-bit byte is received. 
3) The FSX or FSR pin is brought high. 

4) The EODX or EODR pin emits a low-going pulse. 


For asynchronous communication, the operating sequence is similar, but FSX 
and FSR do not occur at the same time (see Figure 6-24). For proper opera- 
tion, the TXM bit in the TMS320C2x control register should be set to 0 so that 
the FSX pin of the TMS320C2x is configured as an input, the format (FO) 
status bit is set to 0, and the AIC WORD/BYTE pin is at logic high. After each 
receive and transmit operation, the TMS320C2x asserts an internal receive 
(RINT) and transmit (XINT) interrupt, which may be used to control program 
execution. 
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Figure 6-22. Interface of TLC32040 to TMS320C2x 
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Figure 6-23. Synchronous Timing of TLC32040 to TMS320C2x 
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Figure 6-24. Asynchronous Timing of TLC32040 to TMS320C2x 


For further information regarding the AIC interface, see page 11-196 of Linear 
and Interface Circuits Applications, Volume 3: Peripheral Drivers, Data Acqui- 
sition Systems, Hall-Effect Devices, published by Texas Instruments. 
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6.5.3 Digital-to-Analog (D/A) interface 


TM4S32020 


V, 


The high-speed operation of the internal logic circuitry of the TLC7524 8-bit 
digital-to-analog (D/A) converter allows an interface to the TMS32020 with 
a minimum of external circuitry required. Figure 6-25 shows the interface cir- 
cuitry, which consists of one SN74ALS138 3-to-8-line decoder used to de- 
code the address of the peripheral. 


74ALS138 


AS Viet 338 where D = digital input 


Figure 6-25. Interface of TLC7524 to TMS32020 


When the TMS32020 executes an OUT instruction (see Figure 6-28), the 
peripheral address is placed on the address bus and the IS line goes low, in- 
dicating that the address on the bus corresponds to an 1|/O port and not ex- 
ternal data or program memory. A low level at IS enables the 74ALS138 
decoder, and the Y-output, corresponding to the address on the bus, is 
brought low. When the Y-output is brought low, the TLC7524 is enabled and 
the data appearing on the data bus is latched into the D/A converter by 
STRB. The controlling software for the D/A interface is given on page 11-204 
of Linear and Interface Circuits Applications, Volume 3: Peripheral Drivers, 
Data Acquisition Systems, Hall-Effect Devices, published by Texas Instru- 
ments. 
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Figure 6-26. Interface Timing of TLC7524 to TMS$32020 


6.5.4 Analog-to-Digital (A/D) Interface 
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The TMS320C2x can be interfaced to 8-bit A/D converters, such as the 
TLCO820. However, because the control circuitry of the TLCO820 operates 
much more slowly than the TMS320C2x, it cannot be directly interfaced. In 
the TLC0820 to TMS32020 interface design shown in Figure 6-27, the fol- 


lowing logic devices are used in the interface circuit: 


A 3-line to 8-line decoder (SN74ALS1 38) 
A quad 2-input NAND gate (SN74LS00) 
A hex inverter (SN74LS04) 

A quad 2-input OR gate (SN74LS32) 

A quad D-type flip-flop (SN74LS175) 
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Figure 6-27. Interface of TLC0820 to TMS32020 


The 74LS138 decodes the addresses assigned to the TLCO820. One of the 
addresses is used when performing a write operation; the other is used for a 
read operation. The two different addresses are necessary to ensure that the 
correct number of wait states is provided for the write and read operations. 
The controlling software for the A/D interface is given on page 11-206 of 
Lin-ear and Interface Circuits Applications, Volume 3: Peripheral Drivers, Data 
Acquisition Systems, Hall-Effect Devices, published by Texas Instruments. 


With the TMS32020 running at 20 MHz and the TLCO820 configured as slow 
memory, three wait states are necessary to provide a write pulse of sufficient 

_ length. After conversion has begun (with the rising edge of the WR signal), the 
TMS32020 must wait at least 600 ns before the conversion result can be read. 
Sufficient delay should be provided in software. To read the conversion result, 
sufficient wait states must be provided to allow for the data access time (320 
ns minimum) of the TLCOQ820. As shown in the IN instruction timing diagram 
of Figure 6-28, two wait states are provided when accessing port 1. 
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Figure 6-28. Interface Timing of TLC0820 to TMS32020 
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6.5.5. 1/O Ports 


1/O design on the TMS320C2x is treated the same way as memory. The I/O 
address space is distinguished from the local program/data memory space by 
the IS signal. IS goes low at the beginning of the memory cycle. All other 
control signals and timing parameters are the same as those for the 
program/data external memory interface. 


The TMS320C2x software instructions can access 16 input and 16 output 
ports. The four least significant bits of the address bus specify the particular 
port being accessed. A pair of 74AS138s can be used to fully decode these 
address bits (see Figure 6-29). 
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Figure 6-29. I/O Port Addressing 


A simple interface between two processors can be implemented using up to 
16 bidirectional 1/O ports connected to the TMS320C2x. An interprocessor 
communication path can be formed by memory-mapping peripherals to the 
1/O ports of the TMS320C2x. In this manner, the TMS320C2x can connect 
to parallel A/Ds, registers, FIFOs, two-port memories, or other peripheral de- 
vices. In a multiprocessing scheme, intelligent peripherals can be memory- 
mapped into the I/O ports. Here the TMS320C2x can communicate with 
UARTs, general-purpose microprocessors, disk controllers, video controllers, 
or other peripheral processors. 


Using an 8-bit general-purpose microprocessor, such as TI’s TMS70C42, for 
a keyboard interface is an example of a TMS320C2x I/O port multiprocessing 
scheme, as shown in Figure 6-30. The TMS70C42 may be mapped into the 
TMS320C2x |/O space using latches to store the transferred data. In a single 
or multiple 1/O port multiprocessing configuration, the four LSBs of the ad- 
dress bus are decoded to determine which of the 16 I/O ports on the 
TMS320C2x is being accessed. The TMS320C2x selects the I/O space (IS) 
for its external bus and reads/writes data using the IN/OUT instructions. 


Processor-controlled signals between the TMS320C2x and the peripheral de- 
vice indicate when data is available to be read. This interprocessor commu- 
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nication is facilitated by using the input and output pins of the TMS70C42 (or 
other peripheral processor). In an 1/O multiprocessing configuration, the I/O 
port address space is limited, and data transfers are relatively slow compared 
to a direct memory access or global memory configuration. 


TMS320C 2x TMS70C42 


DECODE 


LoGic > CONTROL 


PINS 


PROGRAM 
MEMORY 


Figure 6-30. !/O Port Processor-to-Processor Communication 
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6.6 System Applications 


The TMS320C2x is used in a wide variety of systems. Several applications in 
the areas of telecommunications, graphics and image processing, high-speed 
control, instrumentation, and numeric processing are described in the follow- 
ing paragraphs to illustrate basic approaches to system design using the 
TMS320C2x. 


6.6.1 Echo Cancellation 


Digital signal processing is extensively used in telecommunications applica- 
tions. !n echo cancellation, an adaptive FIR filter performs the modelling 
routine and signal modifications required to adaptively cancel the echo caused 
by impedance mismatches in telephone transmission lines. The TMS320C25’s 
large on-chip RAM of 544 words and on-chip ROM of 4K words allow it to 
execute a 256-tap adaptive filter (82-ms echo cancellation) without external 
data or program memory. Figure 6-31 shows a common configuration for an 
echo canceller that uses a TCM29C16 codec interface. 


LINE 
INTERFACE 


TCM29C16 
CODEC 


TMS320C25 


Figure 6-31. Echo Canceller 


6.6.2 High-Speed Modem 


In high-speed modems, a signal processor is used to perform functions such 
as modulation/demodulation, adaptive equalization, and echo cancellation. 
The TMS320C2x large memory space allows it to support multiple standards 
such as Bell 103, Bell 212A, V.22 bis, V.29, V.32, and V.33, as well as pro- 
prietary algorithms. The modem shown in Figure 6-32 consists of the host 
interface, controller, DSP, and analog front-end. 
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Figure 6-32. High-Speed Modem 


6.6.3 Voice Coding 


Voice coding techniques, such as full-duplex 32-kbps adaptive differential 
pulse-code modulation (CCITT G.721), 16-kbps subband coding, and linear 
predictive coding, are frequently used in voice transmission and storage. The 
speed of the TMS320C2x in performing arithmetic computations, normaliza- 
tion, and bit manipulation enables it to implement these functions usually in- 
ternally (i.e., with no external devices). Figure 6-33 shows a voice coding 
system consisting of a TMS320C2x DSP, TCM29C16 codec or TLC32040 
AIC, and optional external memory. 


TCM29C16 
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Figure 6-33. Voice Coding System 
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6.6.4 Graphics and Image Processing 


In graphics and image processing applications, a signal processor's ability to 
interface with a host processor is important. The TMS320C2x multiprocessor 
interface enables it to be used in a variety of host/coprocessor configurations 
(see Figure 6-34 for an example of a graphics system configuration). Graph- 
ics and image processing applications can use the large directly addressable 
external data memory space and global memory capability to allow graphical 
images in memory to be shared with a host processor, thus minimizing data 
transfers. Indexed indirect addressing modes on the TMS320C2x allow ma- 
trices to be processed row-by-row when matrix multiplication is performed for 
3-D image rotation, translation, and scaling. 
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Figure 6-34. Graphics System 


6.6.5 High-Speed Control 


High-speed control applications, such as robotics, use the TMS320C2x gen- 
eral-purpose features for bit manipulation, logical operations, timing synchro- 
nization, and fast data transfers (10 million 16-bit words per second). In 
addition to the numeric-intensive control functions typical of robotic applica- 
tions, the TMS320C2x provides a host interface whereby a robot can com- 
municate to a central host processor (see Figure 6-35). The TMS320C2x is 
also used in the closed-loop systems of disk drives for signal conditioning, 
filtering, high-speed computing, and multichannel multiplexing. 
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Figure 6-35. Robot Axis Controller Subsystem 


6.6.6 Instrumentation and Numeric Processing 


Instrumentation, such as spectrum analyzers, require a large data memory 
space and a processor such as the TMS320C2x capable of performing long- 
length FFTs and generating high-precision functions with minimal external 
hardware. Figure 6-36 shows an example of an instrumentation system. Nu- 
meric processing applications benefit from the high throughput, multiproc- 
essing, and data memory expansion capabilities of the TMS320C2x. 
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Figure 6-36. Instrumentation System 
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TMS320 SECOND-GENERATION 
DIGITAL SIGNAL PROCESSORS 


MAY 1987-—REVISED SEPTEMBER 1987 


100-ns Instruction Cycle Time 


544 Words of Programmable On-Chip Data 
RAM 


4K Words of On-Chip Program ROM 

128K Words of Data/Program Space 

32-Bit ALU/Accumulator 

16 x 16-Bit Multiplier with a 32-Bit Product 


Block Moves for Data/Program Management 


Repeat Instructions for Efficient Use of 
Program Space 


Serial Port for Direct Codec Interface 


Synchronization Input for Synchronous 
Multiprocessor Configurations 


© Wait States for Communication to Slow 
Off-Chip Memories/Peripherals 


On-Chip Timer for Control Operations 
Single 5-V Supply 

Packaging: 68-Pin PGA and 68-Pin PLCC 
Commercial and Military Versions Available 


NMOS Technology: 
— TMS32020 ..... 200-ns cycle time 


® CMOS Technology: 
— TMS320C25 ....100-ns cycle time 


This data sheet provides complete design 
documentation for the second-generation 
devices of the TMS320 family. This facilitates 
the selection of the devices best suited for user 
applications by providing all specifications and 
special features for each TMS320 member. This 
data sheet is divided into four major sections: 
architecture, electrical specifications (NMOS and 
CMOS), timing diagrams, and mechanical data. 
In each of these sections, generic information is 
presented first, followed by specific device 
information. An index is provided for quick 
reference to specific information about a device. 
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TSee Pin Assignments Table and Pin Nomenclature Table 
(Page 2) for location and description of all pins. 


The TMS320 family of 16/32-bit single-chip digital signal processors combines the flexibility of a high- 
speed controller with the numerical capability of an array processor, thereby offering an inexpensive 
alternative to multichip bit-slice processors. The highly paralleled architecture and efficient instruction set 
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PGA/PLCC PIN ASSIGNMENTS 


FUNCTION PIN | FUNCTION FUNCTION PIN | FUNCTION FUNCTION PIN FUNCTION PIN 


A12 K8/40 H1/22 | Vcc H2/23 
A13 19/41 is J11/46| Vcc L6/35 
Al4 K9/42 Mct aest Vss B1/10 
A15 110/43 DS MSC C10/59| Vss K11/44 
Bio 87/68 PS J10/47| Vss 12/27 


BR G11/50 B8/66 XF D11/56 
CLKOUT1 C11/58 RS A8/65 x1 G10/51 
CLKOUT2 D10/57 H11/48| X2/CLKIN F11/52 
CLKR B9/64 H10/49 

CLKX A9/63 F2/19 

DO F1/18 A10/61 

01 E2/17 Ti B10/62 


tOn the TMS32020, MP/MC must be connected to Vcc. 
PIN NOMENCLATURE 


5-V supply pins 
vee Ground pins 
x1 Output from internal oscillator for crystal 
X2/CLKIN Input to internal oscillator from crystal or external clock 
CLKOUT1 Master clock output (crystal or CLKIN frequency/4) 
CLKOUT2 A second clock output signal 
16-bit data bus D15 (MSB) through DO (LSB). Multiplexed between program, data, and 1/O spaces. 
16-bit address bus A15 (MSB) through AO (LSB) 
Program, data, and I/O space select signals 
Read/write signal 
Strobe signal 
Reset input 
External user interrupt inputs 
Microprocessor/microcomputer mode select pin 
Microstate complete signal 
Interrupt acknowledge signal 
Oata ready input. Asserted by external logic when using slower devices to indicate that the current bus 
transaction is complete. 
Bus request signal. Asserted when the TMS320C25 requires access to an external global data memory 
space. 
External flag output (latched software-programmable signal!) 
Hold input. When asserted, TMS320C25 goes into an idle mode and places the data, address, and 
control lines in the high impedance state. 
Hold acknowledge signal 
Synchronization input 
Branch contro! input. Polled by BIOZ instruction. 
Serial data receive input 


Clock for receive input for serial port 

Frame synchronization pulse for receive input 
Serial data transmit output 

Clock for transmit output for serial port 


Frame synchronization pulse for transmit. Configurable as either an input or an output. 


T1O/Z denotes input/output/high-impedance state. 
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provide speed and flexibility to produce a MOS microprocessor family capable of executing 10 MIPS (million 
instructions per second). The TMS320 family optimizes speed by implementing functions in hardware that 
other processors implement through microcode or software. This hardware-intensive approach provides 
the design engineer with processing power previously unavailable on a single chip. 


The TMS320 family consists of three generations of digital signal processors. The first generation contains 
the TMS32010 and its spinoffs. The second generation includes the TMS32020 and TMS320C25, which 
are described in this data sheet. The TMS320C30 is the third-generation processor, designed for higher 
performance. Many features are common among the TMS320 processors. Specific features are added 
in each processor to provide different cost/performance tradeoffs. Software compatibility is maintained 
throughout the family to protect the user’s investment in architecture. Each processor has software and 
hardware tools to facilitate rapid design. 


introduction 


The TMS32010, the first NMOS digital signal processor in the TMS320 family, was introduced in 1983. 
Its powerful instruction set, inherent flexibility, high-speed number-crunching capabilities, and innovative 
architecture have made this high-performance, cost-effective processor the ideal solution to many 
telecommunications, computer, commercial, industrial, and military applications. Since that time, the 
TMS320C10, a low-power CMOS version of the industry-standard TMS32010, and other spinoff devices 
have been added to the first generation of the TMS320 family. 


The second generation of the TMS320 family (referred to as TMS320C2x) includes two members, the 
TMS32020 and the TMS320C25. The architecture of these devices is based upon that of the TMS32010. 


The TMS32020, processed in NMOS technology, is source-code compatible with the TMS32010 and in 
many applications is capable of two times the throughput of the first-generation devices. Its enhanced 
instruction set (109 instructions), large on-chip data memory (544 words), large memory spaces, on-chip 
serial port, and hardware timer make the TMS32020 a powerful addition to the TMS320 family. 


The TMS320C25 is the newest member of the TMS320 second generation. It is processed in CMOS 
technology, is capable of an instruction cycle time of 100 ns, and is pin-for-pin and object-code compatible 
with the TMS32020. The TMS320C25’s enhanced feature set greatly increases the functionality of the 
device over the TMS32020. Enhancements include 24 additional instructions (133 total), eight auxiliary 
registers, an eight-level hardware stack, 4K words of on-chip program ROM, a bit-reversed indexed- 
addressing mode, and the low-power dissipation inherent to the CMOS process. 


Table 1 provides an overview of the second-generation TMS320 processors with comparisons of memory, 
1/O, cycle timing, power, package type, technology, and military support. For specific availability, contact 


the nearest TI sales office. 
TYP PACKAGE 
POWER TYPE 
(mw) { PGA PLCC 


Pizs0 [es | 
[200 [6a 668 | 


TABLE 1. TMS320 SECOND-GENERATION DEVICE OVERVIEW 


a 
DEVICE ON-CHIP OFF-CHIP TIMER 
YES 


TMS32020? (NMOS) 64K 64K | YES 16 x 16 | yes | 200 | 
TMS320C25? (CMOS) | 544 4K | 64k 64k | YES 16x16 CON| YES | 100 | 
TSER = serial; PAR = parallel; DMA = direct memory access; CON = concurrent DMA. 


Military version planned; contact nearest TI sales office for availability. 
§PLCC version planned; contact nearest TI sales office for avaiiability. 
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Key Features: TMS32020 
® 200-ns Instruction Cycle Time 
@ 544 Words of On-Chip Data RAM 


@ 128K Words of Total Data/Program Memory 
Space +5V GND 


© Wait States for Communication to Slower Off- 
Chip Memories 


Object Code Compatible with the TMS32010 


INTERRUPTS 


256-WORD | 288-WORD 
DATA/PROG| DATA 
RAM __! _RAM 


e 
® Single-Cycle Multiply/Accumulate Instructions 
@ Repeat Instructions 

@ Global Data Memory Interface 

@ 

e 


Block Moves for Data/Program Management 


Five Auxiliary Registers with Dedicated 
Arithmetic Unit 


Serial Port for Multiprocessing or Interfacing to 
Codecs, Serial Analog-to-Digital Converters, etc. 


On-Chip Clock Generator 

Single 5-V Supply 

NMOS Technology 

68-Pin Grid Array (PGA) Package 


Key Features: TMS320C25 

® 100-ns Instruction Cycle Time 

® 4K Words of On-Chip Program ROM 
@ 544 Words of On-Chip RAM > 
° 


128K Words of Total Program/Data Memory 
Space 


Wait States for Communications to Slower Off- 
Chip Memories 


Object-Code Compatible with the TMS32020 intenRUPTS [{2ee.wonb | 268-WORD 


® 24 Additional Instructions to Support Adaptive DATA/PROG; DATA 
Filtering, FFTs, and Extended-Precision Arithmetic RAM {| RAM 


© Block Moves for Data/Program Management 
@ Single-Cycle Multiply/Accumulate Instructions 

‘ sis : : f MULTIPLIER 
@ Eight Auxiliary Registers with Dedicated 


Arithmetic Unit 32-BIT ALU/ACC 


@ Bit-Reversed Indexed-Addressing Mode for 


+5 V GND 


Radix-2 FFTs 

Double-Buffered Serial Port | TIMER 
TIMER 

On-Chip Clock Generator 

Single 5-V Supply 


CMOS Technology 


68-Pin Grid Array (PGA) Package and 
68-Lead Chip Carrier (PLCC) Package 
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MULTI- 
PROCESSOR 
INTERFACE 


SERIAL 
INTERFACE 


ADDRESS (16) 


DATA (16) 


MULTI- 
PROCESSOR 
| ——____—i— 
INTERFACE 
SERIAL 
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architecture 


The TMS320 family utilizes a modified Harvard architecture for speed and flexibility. In a strict Harvard 
architecture, program and data memory lie in two separate spaces, permitting a full overlap of instruction 
fetch and execution. The TMS320 family’s modification of the Harvard architecture allows transfers 
between program and data spaces, thereby increasing the flexibility of the device. This modification permits 
coefficients stored in program memory to be read into the RAM, eliminating the need for a separate 
coefficient ROM. It also makes available immediate instructions and subroutines based on computed values. 


Increased throughput on the TMS320C2x devices for many DSP applications is accomplished by means 
of single-cycle multiply/accumulate instructions with a data move option, up to eight auxiliary registers 
with a dedicated arithmetic unit, and faster !/O necessary for data-intensive signal processing. 


The architectural design of the TMS320C2x emphasizes overall speed, communication, and flexibility in 
processor configuration. Control signals and instructions provide floating-point support, block-memory 
transfers, communication to slower off-chip devices, and multiprocessing implementations. 


32-bit ALU/accumulator 


The 32-bit Arithmetic Logic Unit (ALU) and accumulator perform a wide range of arithmetic and logical 
instructions, the majority of which execute in a single clock cycle. The ALU executes a variety of branch 
instructions dependent on the status of the ALU or a single bit in a word. These instructions provide the 
following capabilities: 


¢ Branch to an address specified by the accumulator 
¢ Normalize fixed-point numbers contained in the accumulator 
¢ Test a specified bit of a word in data memory. 


One input to the ALU is always provided from the accumulator, and the other input may be provided from 
the Product Register (PR) of the multiplier or the input scaling shifter which has fetched data from the 
RAM on the data bus. After the ALU has performed the arithmetic or logical operations, the result is stored 
in the accumulator. 


The 32-bit accumulator is split into two 16-bit segments for storage in data memory. Additional shifters 
at the output of the accumulator perform shifts while the data is being transferred to the data bus for 
storage. The contents of the accumulator remain unchanged. 


scaling shifter 


The TMS320C2x scaling shifter has a 16-bit input connected to the data bus and a 32-bit output connected 
to the ALU. The scaling shifter produces a left shift of O to 16 bits on the input data, as programmed 
in the instruction. The LSBs of the output are filled with zeroes, and the MSBs may be either filled with 
zeroes or sign-extended, depending upon the status programmed into the SXM (sign-extension mode) bit 
of status register ST1. 


16 x 16-bit parallel multiplier 


The 16 x 16-bit hardware multiplier is capable of computing a signed or unsigned 32-bit product in a 
single machine cycle. The multiplier has the following two associated registers: 


¢ A 16-bit Temporary Register (TR) that holds one of the operands for the multiplier, and 
¢ A 32-bit Product Register (PR) that holds the product. 


Incorporated into the instruction set are single-cycle multiply/accumulate instructions that allow both 
operands to be processed simultaneously. The data for these operations may reside anywhere in internal 
or external memory, and can be transferred to the multiplier each cycle via the program and data buses. 


Four product shift modes are available at the Product Register (PR) output that are useful when performing 
multiply/accumulate operations, fractional arithmetic, or justifying fractional products. 
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functional block diagram (TMS320C2x) 
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DATA/PROG 
RAM (256 x 16) 
BLOCK BO 


BLOCK B2 
(32 x 16) 


DATA RAM 
BLOCK B1 
(256 x 16) 


SS 


32 


TShifters on TMS32020 (0, 1, 4) 
NOTE: Shaded areas are for TMS320C25 only. 


LEGEND: 
ACCH = Accumulator high {FR == Interrupt flag register PC = Program counter 
ACCL = Accumulator low IMR-_= = Interrupt mask register PFC = Prefetch counter 
ALU == Arithmetic logic unit In = Instruction register RPTC = Repeat instruction counter 
ARAU = Auxiliary register arithmetic unit MCS = Microcall stack GREG = Globe! memory allocation register 
ARB = Auxiliary register pointer buffer QiIR = Queue instruction register RSR = Serial port receive shift register 
ARP = Auzxillary register pointer PR = Product register XSR = Serial port transmit shift register 
oP = Data memory page pointer PRD = Period register for timer ARO-AR? = Auxiliary registers 
ORR = Seria! port data receive register TIM = Timer STO,ST1 = Status registers 
OXR-~ == Serial port data transmit register TR = Temporary register c = Carry bit 
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timer 


The TMS320C2x provides a memory-mapped 16-bit timer for control operations. The on-chip timer (TIM) 
register is a down counter that is continuously clocked by CLKOUT1 on the TMS320C25. The timer is 
clocked by CLKOUT1/4 on the TMS32020. A timer interrupt (TINT) is generated every time the timer 
decrements to zero. The timer is reloaded with the value contained in the period (PRD) register within the 
next cycle after it reaches zero so that interrupts may be programmed to occur at regular intervals of 
PRD + 1 cycles of CLKOUT1 on the TMS320C25 or 4 x PRD x CLKOUT1 cycles on the TMS32020. 


memory control 


The TMS320C2x provides a total of 544 16-bit words of on-chip data RAM, divided into three separate 
blocks (BO, B1, and B2). Of the 544 words, 288 words (blocks B1 and B2) are always data memory, and 
256 words (block BO) are programmable as either data or program memory. A data memory size of 544 
words allows the TMS320C2x to handle a data array of 512 words (256 words if on-chip RAM is used 
for program memory), while still leaving 32 locations for intermediate storage. When using block BO as 
program memory, instructions can be downloaded from external program memory into on-chip RAM and 
then executed. , 


When using on-chip program RAM, ROM, or high-speed external program memory, the TMS320C2x runs 
at full speed without wait states. However, the READY line can be used to interface the TMS320C2x 
to slower, less-expensive external memory. Downloading programs from slow off-chip memory to on-chip 
program RAM speeds processing while cutting system costs. 


The TMS320C2x provides three separate address spaces for program memory, data memory, and I/O. 
The on-chip memory is mapped into either the 64K-word data memory or program memory space, depending 
upon the memory configuration (see Figure 1). The CNFD (configure block BO as data memory) and CNFP 
(configure block BO as program memory) instructions allow dynamic configuration of the memory maps 
through software. Regardless of the configuration, the user may still execute from external program memory. 


The TMS320C2x has six registers that are mapped into the data memory space: a serial port data receive 
register, serial port data transmit register, timer register, period register, interrupt mask register, and global 
memory allocation register. 


interrupts and subroutines 


The TMS320C2x has three external maskable user interrupts INT2-INTO, available for external devices that 
interrupt the processor. Internal interrupts are generated by the serial port (RINT and XINT), by the timer 
(TINT), and by the software interrupt (TRAP) instruction. Interrupts are prioritized with reset (RS) having 
the highest priority and the serial port transmit interrupt (XINT) having the lowest priority. All interrupt 
locations are on two-word boundaries so that branch instructions can be accommodated in those locations 
if desired. 


A built-in mechanism protects multicycle instructions from interrupts. If an interrupt occurs during a 
multicycle instruction, the interrupt is not processed until the instruction is completed. This mechanism 
applies to instructions that are repeated and to instructions that become multicycle due to the READY signal. 


external interface 


The TMS320C2x supports a wide range of system interfacing requirements. Program, data, and I/O address 
spaces provide interface to memory and I/O, thus maximizing system throughput. !/O design is simplified 
by having !/O treated the same way as memory. I/O devices are mapped into the I/O address space using 
the processor’s external address and data buses in the same manner as memory-mapped devices. Interface 
to memory and I/O devices of varying speeds is accomplished by using the READY line. When transactions 
are made with slower devices, the TMS320C2x processor waits until the other device completes its function 
and signals the processor via the READY line. Then, the TMS320C2x continues execution. 
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PROGRAM PROGRAM DATA 
O1>0000) | inTERRUPTS O(>0000) | INTERRUPTS nites ON-CHIP 
AND RESERVED AND RESERVED MEMORY-MAPPED 
(EXTERNAL) (ON-CHIP ROM) REGISTERS 
31(>001F) 31(>001F) 5(>0005) 
32(>0020) 32(>0020) 6(>0006) 


EXTERNAL 


65,535( >FFFF) 


IF MP/MC = 1 


(MICROPROCESSOR MODE) 


65,535(>FFFF) 


ON-CHIP 
ROM 


4015(>O0FAF) 


95(>005F) 


4016(>OFBO) 96(>0060) 
RESERVED 

4095( > OFFF) 127(>007F) 

4096( >1000) 128(>0080) 


EXTERNAL 


IF MP/MC = 0 


(MICROCOMPUTER MODE 


ON TMS320C25 ONLY) 


511(>01FF) 
512(>0200) 


767(>02FF) 
768(>0300) 


1023( >03FF) 
1024(>0400) 


65,535(>FFFF) 


(a) MEMORY MAPS AFTER A CNFD INSTRUCTION 


PROGRAM PROGRAM 
0(>0000) INTERRUPTS O{>0000) INTERRUPTS O0(>0000) 
AND RESERVED AND RESERVED 
(EXTERNAL) (ON-CHIP ROM) 
31(>001F) 31{>001F) 5(>0005) 
ROM 
4015(>OFAF) 95{>O005F) 
4016(>OFBO) 96(>0060) 
RESERVED 
4095( >OFFF) . 127(>007F) 
4096({>1000) 128{>0080) 
511(>01FF) 
EXTERNAL EXTERNAL 512(>0200) 
767(>O2FF) 
768(>0300) 
65,279( >FEFF) 65,279(>FEFF) 1023(>O03FF) 
65,280( >FFOO) ON-CHIP 65,280(>FFOO) ON-CHIP * 1024(>0400) 


BLOCK BO 


65,535( >FFFF) 


IF MP/MC = 1 


(MICROPROCESSOR MODE) 


65,535(>FFFF) 


BLOCK BO 


IF MP/MC = 0 
(MICROCOMPUTER MODE 
ON TMS320C25 ONLY) 


65,535( >FFFF) 


(b) MEMORY MAPS AFTER A CNFP INSTRUCTION 


FIGURE 1. MEMORY MAPS 


ON-CHIP 
BLOCK B2 
RESERVED 

ON-CHIP 
BLOCK BO 

ON-CHIP 
BLOCK Bt 
EXTERNAL 


DATA 
ON-CHIP 


MEMORY-MAPPED 
REGISTERS 


RESERVED 
ON-CHIP 
BLOCK B2 


RESERVED 
DOES NOT 
EXIST 


ON-CHIP 
BLOCK B1 


EXTERNAL 


RESERVED 


PAGE 0 


PAGES 1-3 


PAGES 4-5 


PAGES 6-7 


PAGES 8-511 


PAGE 0 


PAGES 1-3 


PAGES 4-5 


PAGES 6-7 


PAGES 8-511 
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A full-duplex serial port provides communication with serial devices, such as codecs, serial A/D converters, 
and other serial systems. The interface signals are compatible with codecs and many other serial devices 
with a minimum of external hardware. The serial port may also be used for intercommunication between 
processors in multiprocessing applications. 


The serial port has two memory-mapped registers: the data transmit register (DXR) and the data receive 
register (ORR). Both registers operate in either the byte mode or 16-bit word mode, and may be accessed 
in the same manner as any other data memory location. Each register has an external clock, a framing 
synchronization pulse, and associated shift registers. One method of multiprocessing may be implemented 
by programming one device to transmit while the others are in the receive mode. The serial port on the 
TMS320C25 is double-buffered and fully static. 


multiprocessing 


The flexibility of the TMS320C2x allows configurations to satisfy a wide range of system requirements 
and can be used as follows: 


A standalone processor 

A multiprocessor with devices in parallel : 

A slave/host multiprocessor with global memory space 

A peripheral processor interfaced via processor-controlled signals to another device. 


For multiprocessing applications, the TMS320C2x has the capability of allocating global data memory space 
and communicating with that space via the BR (bus request) and READY control signals. Global memory 
is data memory shared by more than one processor. Global data memory access must be arbitrated. The 
8-bit memory-mapped GREG (global memory allocation register) specifies part of the TMS320C2x’s data 
memory as global external memory. The contents of the register determine the size of the global memory 
space. If the current instruction addresses an operand within that space, BR is asserted to request control 
of the bus. The length of the memory cycle is controlled by the READY line. 


The TMS320C2x supports DMA (direct memory access) to its external program/data memory using the 
HOLD and HOLDA signals. Another processor can take complete control of the TMS320C2x’'s external 
memory by asserting HOLD low. This causes the TMS320C2x to place its address, data, and control lines 
in a high-impedance state, and assert HOLDA. On the TMS320C25, program execution from on-chip ROM 
may proceed concurrently when the device is in the hold mode. 


instruction set 


The TMS320C2x microprocessor implements a comprehensive instruction set that supports both numeric- 
intensive signal processing operations as well as general-purpose applications, such as multiprocessing 
and high-speed control. The TMS32020 source code is upward-compatible with TMS320C25 source code. 
TMS32020 object code runs directly on the TMS320C25. 


For maximum throughput, the next instruction is prefetched while the current one is being executed. Since 
the same data lines are used to communicate to external data/program or I/O space, the number of cycles 
may vary depending upon whether the next data operand fetch is from internal or external memory. Highest 
throughput is achieved by maintaining data memory on-chip and using either internal or fast external program 
memory. 


addressing modes 


The TMS320C 2x instruction set provides three memory addressing modes: direct, indirect, and immediate 
addressing. ; 


Both direct and indirect addressing can be used to access data memory. In direct addressing, seven bits 
of the instruction word are concatenated with the nine bits of the data memory page pointer to form the 
16-bit data memory address. Indirect addressing accesses data memory through the auxiliary registers. 
In immediate addressing, the data is based on a portion of the instruction word(s). 
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In direct memory addressing, the instruction word contains the lower seven bits of the data memory address. 
This field is concatenated with the nine bits of the data memory page pointer to form the full 16-bit address. 
Thus, memory is paged in the direct addressing mode with a total of 512 pages, each page containing 
128 words. 


Up to eight auxiliary registers (ARO-AR7) provide flexible and powerful indirect addressing (five on the 
TMS32020, eight on the TMS320C25). To select a specific auxiliary register, the Auxiliary Register Pointer 
(ARP) is loaded with a value from O to 7 for ARO through AR7, respectively. 


There are seven types of indirect addressing: auto-increment or auto-decrement, post-indexing by either 
adding or subtracting the contents of ARO, single indirect addressing with no increment or decrement, 
and bit-reversal addressing (used in FFTs on the TMS320C25 only) with increment or decrement. All 
operations are performed on the current auxiliary register in the same cycle as the original instruction, 
following which the current auxiliary register and ARP may be modified. 


repeat feature 


A repeat feature, used with instructions such as multiply/accumulates, block moves, I/O transfers, and 
table read/writes, allows a single instruction to be performed up to 256 times. The repeat counter (RPTC) 
is loaded with either a data memory value (RPT instruction) or an immediate value (RPTK instruction). The 
value of this operand is one less than the number of times that the next instruction is executed. Those 
instructions that are normally multicycle are pipelined when using the repeat feature, and effectively become 
single-cycle instructions. 


instruction set summary 


Table 2 lists the symbols and abbreviations used in Table 3, the TMS320C25 instruction set summary. 
Table 3 consists primarily of single-cycle, single-word instructions. Infrequently used branch, I/O, and CALL 
instructions are multicycle. The instruction set summary is arranged according to function and alphabetized 
within each functional grouping. The symbol (1) indicates those instructions that are not included in the 
TMS32010 instruction set. The symbol (*) indicates instructions that are not included in the TMS32020 
instruction set. 


TABLE 2. INSTRUCTION SYMBOLS 


sympot_ | MEANING 


4-bit field specifying a bit code 

2-bit field specifying compare mode 

Data memory address field 

Format status bit 

Addressing mode bit 

Immediate operand field 

Port address (PAO through PA15 are predefined 
assembler symbols equal to O through 15, respectively.) 
2-bit field specifying P register output shift code 


3-bit operand field specifying auxiliary register 
4-bit left-shift code 
3-bit accumulator left-shift field 
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TABLE 3. TMS320C25 INSTRUCTION SET SUMMARY 
ACCUMULATOR MEMORY REFERENCE INSTRUCTIONS 


NO. INSTRUCTION BIT CODE 
WORDS 


MNEMONIC DESCRIPTION 


1514131211109 876543210 
Absolute value of accumulator 000110 1 
Add to accumulator with shift 
Add to accumulator with carry 
Add to high accumulator 
Add to accumulator short immediate 
Add to low accumulator with sign 
extension suppressed 
Add to accumulator with shift specified by 
T register 
Add to accumulator long immediate with shift 
AND with accumulator 
AND immediate with accumulator with shift 
Complement accumulator 
Load accumulator with shift 
Load accumulator immediate short 
Load accumulator with shift specified by T register 
Load accumulator long immediate with shift 
Negate accumulator 
Normalize contents of accumulator 
OR with accumulator 
OR immediate with accumulator with shift 
Rotate accumulator left 
Rotate accumulator right 
Store high accumulator with shift 
Store low accumulator with shift 
Subtract from accumulator long immediate with shift 
Shift accumulator left 
Shift accumulator right 
Subtract from accumulator with shift 
Subtract from accumulator with borrow 
Conditional subtract - 
Subtract from high accumulator 
Subtract from accumulator short immediate 


=e 48248 Seo oanpyDe ew oan See ap Se we ew ew DO ABW Dd 


on oooose=]=200 22202220 20-2402 
oooo o -cor0o0coor-ro0o0er00c0cqe0c0 +0 4 


Subtract from low accumulator with sign 
extension suppressed 

Subtract from accumulator with shift specified by 
T register 

Exclusive-OR with accumulator 

Exclusive-OR immediate with accumulator with shift 
Zero accumulator 

Zero low accumulator and load high accumulator 
Zero low accumulator and load high accumulator 
with rounding 

Zero accumulator and toad low accumulator with 


sign extension suppressed 


TThese instructions are not included in the TMS32010 instruction set. 
+These instructions are not included in the TMS$32020 instruction set. 
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TABLE 3. TMS320C25 INSTRUCTION SET SUMMARY (CONTINUED) 


AUXILIARY REGISTERS AND DATA PAGE POINTER INSTRUCTIONS 


MNEMONIC Oreenipiicn NO. INSTRUCTION BIT CODE 
WORDS 


1514131211109 876543210 
110 «—K— > 
1100101 0 0<CMP 
+—-R—> | <«<——D—> 
+R SK 
10110001 <«R—»> 
010% «—\—D— > 
00 <————DP—_> 
+R> 00000000 
1011 *_—\pD—> 
+R | +) 
117 ———_K———_> 


Add to auxiliary register short immediate 
Compare auxiliary register with auxiliary register ARO 


- Oo 


Pe ee ee er ery 


Load auxiliary register 

Load auxiliary register short immediate 

Load auxiliary register pointer 

Load data memory page pointer 

Load data memory page pointer immediate 
Load auxiliary register long immediate 

Modify auxiliary register 

Store auxiliary register 

Subtract from auxiliary register short immediate 


T REGISTER, P REGISTER, AND MULTIPLY INSTRUCTIONS 


eo-=-00=0 
~=90090000=0- 
---4c2 404 720-2054 


oO 


INSTRUCTION BIT CODE 
MNEMONIC DESCRIPTION 


(2) 


Add P register to accumulator 

Load high P register 

Load T register 

Load T register and accumulate previous product 
Load T register, accumulate previous product, 

and move data 

Load T register and store P register in accumulator 
Load T register and subtract previous product 
Multiply and accumulate 


oo oo = 


ooo 
—-- oO. 


Multiply and accumulate with data move 
Multiply (with T register, store product in P register} 
Multiply and accumulate previous product 
Multiply immediate 

Multiply and subtract previous product 
Multiply unsigned 

Load accumulator with P register 
Subtract P register from accumulator 
Store high P register 

Store low P register 

Set P register output shift mode 


ooo 2-720 


000 1 0<PMP» 
+\_ —_D——__> 
+\_—_)D-——__> 


Square and accumulate 


ee ee ee ee ee ee ee Se ee ee See ee eS ee sy 
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"-O444%n4 450 C0024 40 
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Square and subtract previous product 


TThese instructions are not included in the TMS32010 instruction set. 
+These instructions are not included in the TMS32020 instruction set. 
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TABLE 3. TMS320C25 INSTRUCTION SET SUMMARY (CONTINUED) 


BRANCH/CALL INSTRUCTIONS 


MNEMONIC SESGRIPTION NO. INSTRUCTION BIT CODE 
WORDS 


1614131211109 876543210 


2 


1 
1 1 
2 1 
2 1 
2 1 
2 ie) 
2 1 
2 1 
2 1 
2 1 
2 1 
2 fe) 
2 1 
2 1 
2 1 
2 1 
1 1 
2 1 
1 1 


ay 
= 
~ 


Branch unconditionally 

Branch to address specified by accumulator 
Branch on auxiliary register not zero 
Branch if TC bit + O 

Branch if TC bit = O 

Branch on carry 

Branch if accumulator = 0 

Branch if accumulator > 0 

Branch on I/O status = O 

Branch if accumulator 

Branch if accumutator 

Branch on no carry 

Branch if no overflow 

Branch if accumulator 

Branch on overflow 

Branch if accumulator 

Call subroutine indirect 

Call subroutine 

Return from subroutine 


OFrFO-2n 240 2424 44 O02 a a2 OH 
CfA 024454246464 4 anni ncn o 
mp -420000-0020 042444454 
--52+20+4- 000024200022 
~se+42 0024244420020 02 4 
eoeoooOH---=-4-00-c00=--0 


; VO AND DATA MEMORY OPERATIONS 


1514131211109 876543210 


N 


Block move from data memory to data memory 
Block move from program memory to data memory 
Data move in data memory 

Format serial port registers 

Input data from port 

Output data to port 

Reset serial port frame synchronization mode 
Reset serial port transmit mode 

Reset external flag 

Set serial port frame synchronization mode 

Set serial port transmit mode 

Set external flag 

Table read 

Table write 


es es ee es se ee 
COfn fee ann w ningun 
—- 2s SV Be eB eS Yt SB Oe as a = 
eo0oeoooO0oOOoOHA0c0OOAa B= 
--cococ0000 0002244 
oa as “~O-- a 
COnn2a4en nr PP LLoe 
Se Oreo ey Boros 
--o000000--o0-- 


Tt These instructions are not included in the TMS32010 instruction set. 
+ These instructions are not included in the TMS32020 instruction set. 
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TABLE 3. TMS320C25 INSTRUCTION SET SUMMARY (CONCLUDED) 


CONTROL INSTRUCTIONS 


1614131211109 876543210 


Test bit 

Test bit specified by T register 
Configure block as data memory 
Configure block as program memory 
Disable interrupt 

Enable interrupt 

(dle until interrupt 

Load status register STO 

Load status register ST1 

No operation 

Pop top of stack to low accumulator 
Pop top of stack to data memory 
Push data memory value onto stack 
Push low accumulator onto stack 
Reset carry bit 

Reset hold mode 

Reset overflow mode 

Repeat Instruction as specified by data memory value 
Repeat instruction as specified by immediate value 
Reset sign-extension mode 

Reset test/control flag 

Set carry bit 

Set hold mode 

Set overflow mode 

Store status register STO 

Store status register ST1 

Set sign-extension mode 

Set test/control flag 

Software interrupt 


001 +B» 
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| 
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T These instructions are not included in the TMS32010 instruction set. 
+ These instructions are not included in the TMS32020 instruction set. 
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DEVICES 


development support 


Texas Instruments offers an extensive line of development support products to assist the user in all aspects 
of TMS320 second-generation-based design and development. These products range from development 
and application software to complete hardware development and evaluation systems such as the XDS/22. 
Table 4 lists the development support products for the second-generation TMS320 devices. 


System development begins with the use of the SoftWare Development System (SWDS) or Emulator (XDS). 
These tools allow the designer to evaluate the processor’s performance, benchmark time-critical code, 
and determine the feasibility of using a TMS320 device to implement a specific algorithm. 


Software and hardware can be developed in parallel by using the macro assembler/linker, simulator, and 
SoftWare Development System for software development and the XDS for hardware development. The 
assembler/linker translates the system’s assembly source program into an object module that can be 
executed by the simulator, XDS, or SWDS. The XDS provides realtime in-circuit emulation and is a powerful 
tool for debugging and integrating software and hardware modules. 


Additional support for the TMS320 products consists of extensive documentation and three-day DSP design 
workshops offered by the Tl Regional Technology Centers (RTCs). The workshops provide hands-on 
experience with the TMS320 development tools. Refer to the 7V@S320 Family Development Support 
Reference Guide for further information about TMS320 development support products and DSP workshops. 
When technical questions arise regarding the TMS320, contact the Texas Instruments TMS320 Hotline, 
(713) 274-2320. 


TABLE 4. TMS320 SECOND-GENERATION SOFTWARE AND HARDWARE SUPPORT 


SOFTWARE TOOLS PART NUMBER 


Macro Assembler/Linker 
VAX VMS T™MDS3242210-08 
TI/IBM MS/PC-DOS TMDS3242810-02 


Simulator 
VAX VMS TMDS3242211-08 
TVIBM MS/PC-DO0S TMDS3242811-02 


SoftWare Development System (SWDS) TMDS3268821 


Digital Filter Design Package (DFDP) 
IBM PC-DOS DFDP-IBM002 


DSP Software Library 
VAX VMS TMDC3240212-18 
TVIBM MS/PC-DOS TMDC3240812-12 


HARDWARE TOOLS PART NUMBER 


Analog Interface Board (AIB) RTC/EVM320C-06 
XDS/22 Emulator TMDS3262221 


XDS/22 Upgrade 
Customer Upgrade TMDS3282226 


TMS320DDK 
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documentation support 


Extensive documentation supports the second-generation TMS320 devices from product announcement 
through applications development. The types of documentation include data sheets with design 
specifications, complete user’s guides, and 750 pages of application reports published in the book, Digita/ 
Signal Processing Applications with the TMS320 Family. An application report, Hardware Interfacing to 
the TMS320C25, is available for that device. 


A series of DSP textbooks is being published by Prentice-Hall and John Wiley & Sons to support digital 
signal processing research and education. The TMS320 newsletter, Details on Signal Processing, is published 
quarterly and distributed to update TMS320 customers on product information. The TMS320 DSP bulletin 
board service provides access to large amounts of information pertaining to the TMS320 family. 


Refer to the 7MS320 Family Development Support Reference Guide for further information about TMS320 
documentation. To receive copies of second-generation TMS320 literature, call the Customer Response 
Center at 1-800-232-3200. 
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TMS32020 


Gee 
absolute maximum ratings over specified temperature range (unless otherwise noted)? 


Supply voltage range, VCC? 2.0... ce ce cee eet e eee tence ee enee -O0.3Vto7V 
Input Voltage range << c.6 cee i eis been ws Sle ee ow es Se Eo be Malek eae pe He Re -0.3Vto7V 
Output voltage range... 1... eee eee een eens -O.3Vto7V 
Continuous power dissipation ... 2.0... 0... 0. cee ee eee eee ee eee 2.0 W 
Operating free-air temperature range .... 2... et eee eee 0°C to 70°C 
Storage temperature range ... 20... 6. cc ee eee tenes -—55°C to 150°C 


TStresses beyond those listed under ‘‘Absolute Maximum Ratings’’ may cause permanent damage to the device. This is a stress rating 
only and functional operation of the device at these or any other conditions beyond those indicated in the ‘‘Recommended Operating 
Conditions’’ section of this specification is not implied. Exposure to absolute-maximum-rated conditions for extended periods may affect 
device reliability. 

tAll voltage values are with respect to Vss. 


recommended operating conditions 


Vcc Supply voltage 4.75 5 5.25 
Vss_ Supply voltage eT Rae Aa 
All inputs except CLKIN 2 Vec +03 
sViie: Hightievss itage [Allinputs except CLKIN, 
He: SBD NDOR veNeRe, CLKIN 2.4 Vec+03 
All inputs except CLKIN -0.3 0.8 
0 ; 


Ta Operating free-air temperature (see Notes 1 and 2) 


NOTES: 1. Case temperature (Tc) must be maintained below 90°C. 
2. Raja = 36°C/Watt, Rejc = 6°C/Watt. 


electrical characteristics over specified free-air temperature range (unless otherwise noted) 


| Vou Fighievel output voltage | Vec=MIN.lon= MAX —SSCSC~C~i aS 
Vou towdevel output voltage | _Vec = MIN.ign = MAX —SSCS~—~—SC‘“‘ SO 
[iz Threestete curent | __Vec=MAXSOSCS—CSSCSCSCS A OSCCSSC« 
Pi__Input current ___—+|[ Ss = Vegto Veg SSCS TO SCSC~«SO 
C—O 
ee 
a 
| 


Ta = 0°C, Vec = MAX, fy = MAX 

Icc Supply current Ta = 25°C, Vcc = MAX, fy = MAX 
: 
Por _laputcapacitence i SCSC—“CSCSCSCSCSCTCC‘d 
[Co Output capacitance. | —SSSSSCSSCSCSSSCCSCSSSSC—CSY 


TAII typical values are at Vcc = 5 V, Ta = 25°C. 


AN Caution. This device contains circuits to protect its inputs and outputs against damage due to high static voltages or electrostatic 
Abita fields. These circuits have been qualified to protect this device against electrostatic discharges (ESD) of up to 2 kV according 
to MIL-STD-883C, Method 3015; however, it is advised that precautions be taken to avoid application of any voltage higher than maximum 
rated voltages to these high-impedance circuits. During storage or handling, the device leads should be shorted together or the device 
should be placed in conductive foam. In a circuit, unused inputs should always be connected to an appropriate logic voltage level, preferably 
either Vcc or ground. Specific guidelines for handling devices of this type are contained in the publication ‘Guidelines for Handling Electrostatic- 
Discharge Sensitive (ESDS) Devices and Assemblies’’ available from Texas Instruments. 
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CLOCK CHARACTERISTICS AND TIMING 


The TMS32020 can use either its internal oscillator or an external frequency source for a clock. 


internal clock option 


The internal oscillator is enabled by connecting a crystal across X1 and X2/CLKIN (see Figure 2). The 
frequency of CLKOUT1 is one-fourth the crystal fundamental frequency. The crystal should be fundamental 
mode, and parallel resonant, with an effective series resistance of 30 ohms, a power dissipation of 1 mW, 
and be specified at a load capacitance of 20 pF. 


fy Input clock frequency A = 0°C to 70°C MHz 
fsx Serial port frequency Ta = 0°C to 70°C 2563 kHz 


PARAMETER TEST CONDITIONS MIN TYP MAX | UNIT 
T 6.7 
50 


CRYSTAL 
“7 (i 


FIGURE 2. INTERNAL CLOCK OPTION 


external clock option 


An external frequency source can be used by injecting the frequency directly into X2/CLKIN with X11 left 
unconnected. The external frequency injected must conform to the specifications listed in the following table. 


switching characteristics over recommended operating conditions (see Note 3) 


twiCL) CLKOUT1/CLKOUT2 low pulse duration 20-15 2Q 20+15 
twiCH) CLKOUT1/CLKOUT2 high pulse duration 2Q-15 2Q 20+15 


ta(C1-C2) CLKOUT1 high to CLKOUT2 low, CLKOUT2 high to CLKOQUT1 high, etc. 


NOTE 3: Q = 1/4tg(c). 
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timing requirements over recommended operating conditions (see Note 3) 


te(cl) CLKIN cycle time 
tHich CLKIN fall time 
tech CLKIN rise time 


twiCIL) CLKIN low pulse duration, tc(c) = 50 ns {see Note 4) 
twiCIH} CLKIN high pulse duration, tc(cl) = 50 ns (see Note 4) 
tsu(s) | SYNC setup time before CLKIN low 

this) SYNC hold time from CLKIN low 


NOTES: 3. O = 1/4tg(¢). 
4. CLKIN duty cycle [tp@ci) + tw(CtH)]/tc(cl) must be within 40-60%. 


2.15 V 


R_ = 8252 


FROM OUTPUT ; 
UNDER TEST TEST 
POINT 


T Cy = 100 pF 


FIGURE 3. TEST LOAD CIRCUIT 


2.0V. 
0.92 V— —_——<«— 2 a= am 
Gages =—SSSS5 Vit (MAX) 


(a) INPUT 


2.4V— 

— ee oY, MIN 
2.2Ve = OH (MIN) 
0.8 V™ 
Seve = SS SS Voi (MAX) 


(b) OUTPUTS 


FIGURE 4. VOLTAGE REFERENCE LEVELS 
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MEMORY AND PERIPHERAL INTERFACE TIMING 


switching characteristics over recommended operating conditions (see Note 3) 


PARAMETER MIN TYP MAX 
tg(c1-s) STRB from CLKOUT1 (if STRB is present) Q-15 Q Q+15 


ta(C2-S) CLKOUT2 to STRB {if STRB is present) 


tsu(A) Address setup time before STRB tow (see Note 5) 
thiA) Address hold time after STRB high (see Note 5) 


twi(SH) STRB high pulse duration (between consecutive cycles, see Note 6) 


taimsc) MSC valid from CLKOUT1 


twi(SL) STRB low pulse duration (no wait states, see Note 6) hae ae 


NOTES: 3. Q = 1/4tg(c), 
5. A15-A0, PS, BS, IS, R/W, and BR timings are all included in timings referenced as ‘‘address."’ 
6. Delays between CLKOUT1/CLKOUT2 edges and STRB edges track each other, resulting in tw(SL) and tw(SH) being 20 
with no wait states. 


timing requirements over recommended operating conditions (see Note 3) 


a 
30-70 

tsu(D)R _Data read setup time before STRB high | 40s 

| thipjr _Dataread hold time from STAB high | 

|_tgisi-ny READY valid after STRE low (no waitstates) | | 

| tgic2H-fy READY valid after CLKOUTZ high 0 | 

| thisi-n) READY hold time after STAB low (no waitstates) | OB 

Gees Se 

aS EY 

ee Se 


STR 
thiC2H-R) READY hold after CLKOUT2 high 
tgiM-R READY valid after MSC valid 
th(M-R READY hold time after MSC valid 


NOTES: 3. Q = 1/4tg(c), 
5. A15-A0, PS, DS, IS, R/W, and BR timings are all included in timings referenced as ‘‘address.”’ 
7. Read data access time is defined as taia) = tsu(A) + tw(SL) — tsu(D)R- 
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RS, INT, BIO, AND XF TIMING 


switching characteristics over recommended operating conditions (see Note 3) 
PARAMETER 


tdiRS) CLKOUT1 low to reset state entered 


td(WACK) CLKOUT1 to IACK valid -25 fe) 25 


td(XF) XF valid before falling edge of STRB 


NOTES: 3. Q = 1/4tgic). 
8. RS, INT, and BIO are asynchronous inputs and can occur at any time during a clock cycle. However, if the specified setup 
time is met, the exact sequence shown in the timing diagrams will occur. 


timing requirements over recommended operating conditions (see Note 3) 


tsu(IN) INT/BIO/RS setup before CLKOUT1 high 


R | soi 
thiIN) INT/BIO/RS hold after CLKOUT1 high = ae 


tw(IN) INT/BIO tow pulse duration tc(C) 
twiRS) RS low pulse duration ; 3teic) 
NOTES: 3. Q = 1/4tg(c). 


8. RS, INT, and BIO are asynchronous inputs and can occur at any time during a clock cycle. However, if the specified setup 
time is met, the exact sequence shown in the timing diagrams will occur. 


HOLD TIMING 


switching characteristics over recommended operating conditions (see Note 3) 


PARAMETER MIN TYP 
tdiC1L-AL) | HOLDA low after CLKOUT1 low -25 
tdis(AL-A) HOLDA low to address three-state 


tdis(C1L-A) Address three-state after CLKOUT1 low (HOLD mode, see Note 9) 
td{HH-AH) HOLD high to HOLDA high 


ten(A-C1L) | Address driven before CLKOUT1 low (HOLD mode, see Note 9) 


NOTES: 3. Q = 1/4tg(c). 
9. A15-A0, PS, DS, 1S, STRB, and R/W timings are all included in timings referenced as ‘‘address."’ 


timing requirements over recommended operating conditions (see Note 3) 


td(C2H-H) HOLD valid after CLKOUT2 high P45 {ns 


NOTE 3: Q = 1/4te(c). 
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SERIAL PORT TIMING 


switching characteristics over recommended operating conditions (see Note 3) 


tdiFL-Dx) 0X valid after FSX falling edge (TXM = 0, see Note 10) 
td(CH-FS) FSX valid after CLKX rising edge (TXM = 1) 


NOTES: 3. Q = 1/4tg(¢). 
10. The last occurrence of FSX falling and CLKX rising. 


timing requirements over recommended operating conditions (see Note 3) 


ee Se IN 
Fusscx) —_ Sevial port clock (CLKX/CLKRIfaltime ——SSOSCSC~—SCS~—SsSC~sSSCSC“‘*‘“‘“~*~*sO*dSC—* 
Fiscx) Serial port clock (CLKX/CLKAI rise time ———S—SCSC~CS~SCSCSSSCSCS~C~CS~S~S 
FewiSck) Serial port clock (CLKX/CLKR low pulse duration (see Note 7) ——~—=«d?SCBO~« 000 | ne 
tw(SCK) Serial port clock (CLKX/CLKR) high pulse duration (see Note 11) | 150 ~~ 12,000 | ns | 
Ee 
[ns 
| ns | 
| ns 


tsu(FS) __FSX/FSR setup time before CLKX/CLKR falling edge (TXM = 0) 
thiFs) __ FSX/FSR hold time after CLKX/CLKR falling edge (TXM = 0) 
tsu(DR) _OR setup time before CLKR falling edge i a0. ee Se 


DR hold time after CLKR falling edge 


thiDR) 


NOTES: 3. Q = 1/4tgi¢). 
11. The duty cycle of the serial port clock must be within 40-60%. 
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absolute maximum ratings over specified temperature range (unless otherwise noted) t 


Supply voltage ange, VEC? cis vistet dee ey tee telat See epee BEAN ESG -O.3Vto7V 
Input voltagea:range: 32.3 oe Sle hee See Wat ea ds Gaeta Beh eels Ss -O.3Vto7V 
Output voltage range: i. i.%: Sie. Me es ates Bloke WE EEN Go eS Sa Alaa aA oan -O.3Vto7V 
Continuous power dissipation .. 0... 0.0 ee eee ee teen eens 1.5W 
Operating free-air temperature range .. 1... ee eens 0°C to 70°C 
Storage temperature range ... 1... . ee eee eens ~55°C to 150°C 


tStresses beyond those listed under ‘‘Absolute Maximum Ratings’’ may cause permanent damage to the device. This is a stress rating 
only and functional operation of the device at these or any other conditions beyond those indicated in the ‘‘Recommended Operating 
Conditions’ section of this specification is not implied. Exposure to absolute-maximum-rated conditions for extended periods may affect 
device reliability. 

tall voltage values are with respect to Vss. 


recommended operating conditions 


Vcc Supply voltage 4.75 5 5,25 
Vss Supply voltage Kew = Oe | 


3 


Vi High-level input voltage 


2.5 
ae ee ee 
lou High-level output current F800 fA 


lo, low-level output current 
Ta Operating free-air temperature 


electrical characteristics over specified free-air temperature range (unless otherwise noted) 


PARAMETER TEST CONDITIONS MIN TYPt MAX | UNIT 


Von High tevet output voltage | Veg = WIN. low = MAX SSCS CV 
Vou _Lowlevel output veltage | Vcc = MIN ig. = MAX —SSSCSC~C~—~iSC“‘COWS COO OV 
Pig Thvee-state curent [Veg = AK SSCS SSSA 
Pir Inputcurent_ SiS = Veg tec SCSSCSC~i SSCS 
[Po ae aay 
e|_tnput capacitance «| SSS id 
[eG Output capacitance | SSS 


TAll typical values are at Vcc = 5 V, Ta = 25°C. 


«& Caution. This device contains circuits to protect its inputs and outputs against damage due to high static voltages or electrostatic 
Aia® fields. These circuits have been qualified to protect this device against electrostatic discharges (ESD) of up to 2 kV according 
to MIL-STD-883C, Method 3015; however, it is advised that precautions be taken to avoid application of any voltage higher than maximum 
rated voltages to these high-impedance circuits. During storage or handling, the device leads should be shorted together or the device 
should be placed in conductive foam. In a circuit, unused inputs should always be connected to an appropriate logic voltage level, preferably 
either Vcc or ground. Specific guidelines for handling devices of this type are contained in the publication ‘Guidelines for Handling Electrostatic- 
Discharge Sensitive (ESDS) Devices and Assemblies’’ available from Texas Instruments. 
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CLOCK CHARACTERISTICS AND TIMING 


The TMS320C25 can use either its internal oscillator or an external frequency source for a clock. 
internal clock option 


The internal oscillator is enabled by connecting a crystal across X1 and X2/CLKIN (see Figure 2). The 
frequency of CLKOUT1 is one-fourth the crystal fundamental frequency. The crystal should be either 
fundamental or overtone mode, and parallel resonant, with an effective series resistance of 30 ohms, a 
power dissipation of 1 mW, and be specified at a load capacitance of 20 pF. Note that overtone crystals 
require an additional tuned LC circuit (see the application report, Hardware Interfacing to the TMS320C25). 


pened CSSOS~St tw OG SSCSC~<‘dSSCt Od 


CRYSTAL 


c1 u C2 
T T 


FIGURE 2. INTERNAL CLOCK OPTION 
external clock option 


An external frequency source can be used by injecting the frequency directly into X2/CLKIN with X11 left 
unconnected. The external frequency injected must conform to the specifications listed in the following table. 


switching characteristics over recommended operating conditions (see Note 3) 


PARAMETER MIN TYP MAX UNIT 
tc(C) CLKOUT1/CLKOUT2 cycle time 97. 597 


7 | ns | 
raicrcg) CLAN high to CLKOUTH/GLKOUTARTAB Highiow ——SSCSC~S~iCiSS8 |e | 
Fig) CIKOUTHIGLKOUTZETABfalime ———SSSSCSC~“‘“S*S*~“‘“dSC‘“‘“‘(C‘C'SSS*S nw 
Fig) CLKOUTH/GLKOUT2ISTRE ise time ———~SSCSCSCSC~S~CSCSCSCSC~i oe 

i | ns | 
| ns | 


tw(CL) CLKOUT1/CLKOUT2 low pulse duration 20-8 2Q 20+8 
twiCH) CLKOUT1/CLKOUTZ2 high pulse duration 20-8 2Q 20+8 


[@) 


td(C1-C2) CLKOUT? high to CLKOUT2 low, CLKOUT2 high to CLKOUT1 high, etc. -5 Q Q+5 


NOTE 3:Q = 1/4t,(c). 
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timing requirements over recommended operating conditions (see Note 3) 


te(ch) CLKIN cycle time 
tect) CLKIN fall time 
trich CLKIN rise time 


twiciL) CLKIN low pulse duration, tcic1) = 50 ns (see Note 4) 
twi(CIH) CLKIN high pulse duration, tgicj) = 50 ns (see Note 4) 
tsu(s) SYNC setup time before CKLIN low 

this) SYNC hold time from CLKIN low 


NOTES: 3. Q = 1/4tgic). 
4. CLKIN duty cycle [tr(cl) + tw(CIH)]/te(cl) Must be within 40-60%. 


2.15 V 


R, = 8252 


FROM OUTPUT 
UNDER TEST TEST 
POINT 


aT CL = 100 pF 


FIGURE 3. TEST LOAD CIRCUIT 


1.88 V- 1H 
0.92 V— a 
0.80 v= ——==> = = Vit (MAX) 


(a) INPUT 


= — = Vox (MIN) 


o6v- ii Voi (MAX) 


(b) OUTPUTS 


FIGURE 4. VOLTAGE REFERENCE LEVELS 
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MEMORY AND PERIPHERAL INTERFACE TIMING 


switching characteristics over recommended operating conditions (see Note 3) 


PARAMETER MIN TYP MAX 
tg(c1-s) STRB from CLKOUT1 (if STRB is present) Q-6 Q a+6 


NOTES: 3.Q = 1/4tg(¢). 
5. A15-A0, PS, DS, IS, R/W, and BR timings are all included in timings referenced as ‘‘address.”’ 
6. Delays between CLKOUT1/CLKOUT2 edges and STAB edges track each other, resulting in twst) and ty(SH) being 20 with 
no wait states. 


timing requirements over recommended operating conditions (see Note 3) 


ta(A Read data access time from address time (read cycle, see Notes 5 and 7) 
tsu(D)R _ Data read setup time before STAB high ) 2300 
thiD)R Data read hold time from STRB high 


36-56 | me | 
Ftyistsny READY valid after STAB ow (no waitstates) ———SSSSSCSCS~C~‘~‘iSSCSC*C*‘“‘“ MO 
Fta(CaH.A) READY valid after CLKOUTZ igh | 20 || 

ns _| 


th(SL-R) READY hold time after STRB low (no wait states) Q+3 
th(C2H-R) READY hold after CLKOUT2 high 


td(M-R) _ READY valid after MSC valid | 20-25 | 
thiM-R) READY hold time after MSC valid ze 


NOTES: 3. Q = 1/4tgic). 
5. A15-A0, PS, BS, 1S, R/W, and BR timings are all included in timings referenced as “‘address.”’ 
7. Read data access time is defined as ta(A) = tsu(A) + twi(SL) — tsu(D)R- 
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RS, INT, BIO, and XF TIMING 


switching characteristics over recommended operating conditions (see Note 3) 


td(XxF XF valid before falling edge of STRB 
NOTES: 3. Q = 1/4tg(c). 


8. RS, INT, and BIO are asynchronous inputs and can occur at any time during a clock cycle. However, if the specified setup 
time is met, the exact sequence shown in the timing diagrams will occur. 


timing requirements over recommended operating conditions (see Note 3) 


tsu(IN) __INT/BIO/RS setup before CLKOUT1 high Es 
th(IN) INT/BIO/RS hold after CLKOUT1 high a ee eer 
ee ae 


teIN) fall time 


tw(IN) INT/BIO tow pulse duration tcic) 


twins) __RS low pulse duration 


NOTES: 3. Q = 1/4tg(c). 
8. RS, INT, and BIO are asynchronous inputs and can occur at any time during a clock cycle. However, if the specified setup time 
is met, the exact sequence shown in the timing diagrams will occur. 


HOLD TIMING 


switching characteristics over recommended operating conditions (see Note 3) 


PARAMETER 


tdiC1L-AL) HOLDA tow after CLKOUT1 tow 


tdis(AL-A) HOLDA ow to address three-state Po ee Gy = 
tdis(C1L-A) Address three-state after CLKOUT1 low (HOLD mode, see Note 9) hae | 


td(HH-AH) HOLD high to HOLDA high 
ten(A-C1L) Address driven before CLKOUT1 low (HOLD mode, see Note 9) 


NOTES: 3. Q = 1/4tg(c). 
9. A15-A0, PS, DS, iS, STRB, and R/W timings are all included in timings referenced as ‘‘address."’ 


timing requirements over recommended operating conditions (see Note 3) 
td(C2H-H) HOLD valid after CLKOUT2 high 


NOTE 3: Q = 1/4tgic). 
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SERIAL PORT TIMING 


switching characteristics over recommended operating conditions (see Note 3) 


PARAMETER MIN TYP MAX UNIT 
td(CH-Dx) _DX valid after CLKX rising edge (see Note 10) 


ta(FL-Dx) OX valid after FSX falling edge (TXM = O, see Note 10) 
ta(CH-FS) FSX valid after CLKX rising edge (TXM = 1) 


NOTES: 3. Q = 1/4tg¢). 
10. The last occurrence of FSX falling and CLKX rising. 


timing requirements over recommended operating conditions (see Note 3) 


tce(SCk) . Serial port clock (CLKX/CLKR) cycle time 
t#(SCK) Serial port clock (CLKX/CLKR) fall time 
tr(SCK) Serial port clock (CLKX/CLKR) rise time 


twi(SCK) Seria! port clock (CLKX/CLKR) low pulse duration (see Note 11) 


tw(SCK) Serial port clock (CLKX/CLKR) high pulse duration (see Note 11) 
tsu(FS) FSX/FSR setup time before CLKX/CLKR falling edge (TXM = 0) 


thiFS) FSX/FSR hold time after CLKX/CLKR falling edge (TXM = O) 


tsu(DR) OR setup time before CLKR falling edge 


th(DR) DR hold time after CLKR falling edge 


NOTES: 3. Q = 1/4tgc). 
11. The duty cycle of the seriat port clock must be within 40-60%. 
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TIMING DIAGRAMS 
This section contains all the timing diagrams for the TMS320 second-generation devices. Refer to the top corner 
for the specific device. 
Timing measurements are referenced to and from a low voltage of 0.8 volts and a high voltage of 2.0 volts, 
unless otherwise noted. 
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memory read timing 
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memory write timing 
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one wait-state memory access timing 
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reset timing 
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SERIAL PORT 
CONTROLS* 


tControl signals are DS, 1S, R/W, and XF. 
tSerial port controls are DX and FSX. 
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interrupt timing (TMS32020) 
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interrupt timing (TMS320C25) 
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BIO timing 
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external flag timing 
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HOLD timing (part A) 
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THOLD is an asynchronous input and can occur at any time during a clock cycle. If the specified timing is met, the exact sequence shown 
will occur; otherwise, a delay of one CLKOUT2 cycle will occur. 
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HOLD timing (part B) 
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THOLD is an asynchronous input and can occur at any time during a clock cycle. If the specified timing is met, the exact sequence shown 
will occur; otherwise, a delay of one CLKOUT2 cycle will occur. 
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HOLD timing (part A) 


crnenn he Nay Ne” 
1 
| 
I 
CLKOUT2 { 
l 


—wl le tic it-aL) 


FETCH 


N - 2 - N = 
EXECUTE eu 


THOLD is an asynchronous input and can occur at any time during a clock cycle. {f the specified timing is met, the exact sequence shown 
will occur; otherwise, a delay of one CLKOUT2 cycle will occur. 
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HOLD timing (part B) 
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THOLD is an asynchronous input and can occur at any time during a clock cycle. If the specified timing is met, the exact sequence shown 
will occur; otherwise, a delay of one CLKOUT2 cycle will occur. 
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serial port receive timing 
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serial port transmit timing 
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MECHANICAL DATA 
68-pin GB grid array ceramic package 


TOP VIEW 


28,448 (1.120) 
27,432 (1.080) 
17,02 (0.670) 

| NOM | 


THERMAL RESISTANCE CHARACTERISTICS 
28,448 (1.120) 
27,432 (1.080) 
17,02 (0.670) 
NOM 
R Junction-to-case 
6JC thermal resistance 


4,953 (0.195) 
2,032 (0.080) agrees 
+ MAX 
ie 
3,302 (0.130) 0,508 (0.020)_,| 1,575 (0.062) pia 
2,794 (0.110) 0,406 (0.016) 1,473 (0.058) 
2,54 (0.100 
TP. | BOTTOM VIEW 
L 
: 2,54 (0.100) 
J ; TP. 
H 
G 
F 
E 
D 
c shisha 
NOM 
|©OOO00000GO(t shits 
AL ©O©OCOCOGC66—+ 


1 2 4 5 6 9 10 11. 6_1.27 (0.050) 
3 7 8 "1 roe 


ALL LINEAR DIMENSIONS ARE IN MILLIMETERS AND PARENTHETICALLY IN INCHES 
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68-pin plastic leaded chip carrier package (TMS320C25 only) 


4,50 (0.177) 
4,24 (0.167) 
2.79 (0.110) 
2,41 (0.095) 


1,35 (0.053) x 45° 0,25 (0.010) R MAX 
1,19 (0.047) \ IN 3 PLACES 


1,27 (0.050) T.P. 


(SEE NOTE B) 
23,62 (0.930) 25,27 (0.995) 
23,11 (0.910) 25,02 (0.985) 


(AT SEATING PLANE) 
24,33 (0.956) 


24,13 (0.950) 
(SEE NOTE A) 


| 24,33 (0.956) (SEE NOTE A) 


24,13 (0.950) 


25,27 (0.995) 
25,02 (0.985) 


0,94 (0.037) 


0.69 (0.027) * 


1,22 (0.048) , aco 
1.07 (0.042) 


I 
SEATING PLANE 


. a 0,81 (0.032) THERMAL RESISTANCE CHARACTERISTICS 


0,66 (0.026) 
| PaRAmereR | max | untr | 
1,52 (0.060) MIN Junction-to-free-air s 
Ti Foun tnomatetnen | “2 | 2 | 
Junction-to-case 
TE Rac thermal! resistance orn | eciw | 
| 0,64 (0.025) MIN 


0,51 (0.020) 
0,36 (0.014) 


LEAD DETAIL 


NOTES: A. Centerline of center pin each side is within 0,10 (0.004) of package centerline as determined by this dimension. 
B. Location of each pin is within 0,127 (0.005) of true position with respect to center pin on each side. 


ALL LINEAR DIMENSIONS ARE IN MILLIMETERS AND PARENTHETICALLY IN INCHES. 
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OCTOBER 1987 


S Temperature ... —55°C to 100°C 68-PIN FJ PACKAGE 
200-ns Instruction Cycle Time CERNE EES DED CO noen 
(TOP VIEW) 


544 Words of Programmable On-Chip Data 
RAM 


128K Words of Data/Program Space 
Sixteen Input and Sixteen Output Channels 6 
16-Bit Parallel Interface 


Directly Accessible External Data Memory 
Space 


Global Data Memory Interface 
16-Bit Instruction and Data Words 
32-Bit ALU and Accumulator 


Single-Cycle Multiply/Accumulate 
Instructions 


O to 16-Bit Scaling Shifter 
Bit Manipulation and Logical Instructions 


Instruction Set Support for Floating-Point 
Operations 


Block Moves for Data/Program Management 


Repeat Instructions for Efficient Use of 
Program Space 


Five Auxiliary Registers and Dedicated 
Arithmetic Unit for Indirect Addressing 68-PIN GB 


PIN GRID ARRAY CERAMIC PACKAGEt 
Serial Port for Direct Codec Interface 


(TOP VIEW) 
Synchronization Input for Synchronous 
Multiprocessor Configurations 
Wait States for Communication to Slow 
Off-Chip Memories/Peripherals 


12 3 4 5 6 7 8 9 10 11 


On-Chip Timer for Control Operations 
Packaging: 

— 68 Pin Ceramic Leaded Chip Carrier 
— 68 Pin Grid Array 


Three External Maskable User Interrupts 
Input Pin Polled by Software Branch 
Instruction 

Programmable Output Pin for Signaling 
External Devices 

2.4-Micron NMOS Technology 

Single 5-V Supply 

On-Chip Clock Generator 


Standard and Class B Processing 


SM Prefix —Standard 
SMJ Prefix—Class B TSee Pin Assignments Table (Page 2) and Pin Nomenclature 
Table (Page 3) for location and description of all pins. 
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description 


The SMJ32020 Digita! Signal Processor is a second-generation member of the SMJ320 group of military 
VLSI digital signal processors and peripherals. The SMJ32020 supports a wide range of digital signal 
processing applications, such as tactical communications, guidance, military modems, sonar, signal 
processing/Al, image processing, speech processing, spectrum analysis, audio processing, digital filtering, 
high-speed control, graphics, and other computation-intensive applications. 


With a 200-ns instruction cycle time and an innovative memory configuration, the SMJ32020 performs 
operations necessary for many realtime digital signal processing algorithms. Since most instructions require 
only one cycle, the SMJ32020 is capable of executing five million instructions per second. On-chip data 
RAM of 544 16-bit words, direct addressing of up to 64K words of external data memory space and 64K 
words of external program memory space, and multiprocessor interface features for sharing memory 
minimize unnecessary data transfers to take full advantage of the capabilities of the processor. 


PGA/CLCC PIN ASSIGNMENTS 


FUNCTION FUNCTION FUNCTION FUNCTION FUNCTION FUNCTION 


K1/26 | A12 K8/40 H1/22 H2/23 
K2/28 | A13 L9/41 1s 311/46 L6/35 
L3/29 | A14 K9/42 AG/1 B1/10 
K3/30 | A15 L10/43 MSC C10/59 K11/44 
L4/31 | BIO B7/68 PS J10/47 L2/27 


K4/32 | BR ~ G11/50 B8/66 011/56 
L5/33 | CLKOUT1 C11/58 RS A8/65 G10/51 
K5/34 | CLKOUT2 010/57 Ww H11/48] X2/CLKIN F11/52 
K6/36 | CLKR B9/64 H10/49 

L7/37 | CLKX A9/63 F2/19 

K7/38 | DO F1/18 A10/61 

18/39 } 01 E2/17 B10/62 
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PIN NOMENCLATURE 


SIGNALS vo/zt DEFINITION 


5-V supply pins 

Ground pins 

Output from internal oscillator for crystal 

Input to internal oscillator from crystal or external clock 

Master clock output (crystal or CLKIN frequency/4) 

A second clock output signal 

16-bit data bus 015 (MSB) through DO (LSB). Multiplexed between program, data, and I/O spaces. 
16-bit address bus A15 (MSB) through AO (LSB) 

Program, data, and I/O space select signals 

Read/write signal 

Strobe signal 

Reset input 

External user interrupt inputs 

Microstate complete signal 

Interrupt acknowledge signal 

Data ready input. Asserted by external logic when using slower devices to indicate that the current bus 
transaction is complete. 

Bus request signal. Asserted when the SMJ32020 requires access to an external global data memory 
space. 

External flag output (latched software-programmable signal) 

Hold input. When asserted, SMJ32020 goes into an idle mode and places the data, address, and 
control lines in the high impedance state. 

Hold acknowledge signal 

Synchronization input 

Branch control input. Polled by BIOZ instruction. 

Serial data receive input 


Clock for receive input for serial port 


Frame synchronization pulse for receive input 
Serial data transmit output 
Clock for transmit output for serial port 


Frame synchronization pulse for transmit. Configurable as either an input or an output. 


T1/0/Z denotes input/output/high-impedance state. 
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functional! block diagram 
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INT(2-0) - (4x16) | (an axa 
7 Ts —_ as TTT 
A15-A0 ee 
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~ DATA BUS 
TRI16) a 


SHIFTER(0-16) 
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FROM IR 


SHIFTER(-6,0, 1,4) 


BLOCK B2 
(32 x 16) 


DATA RAM 
BLOCK B1 
(256 x 16) 


DATA/PROG 
RAM (256 x 16) 
BLOCK BO 


| ACCHI16}| ACCL(16) 


SHIFTERS(O,1,4) 


DATA BUS 
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architecture 


The SMJ32020 architecture is based upon that of the TMS32010, a first-generation member of the TMS320 
family. The SMJ32020 increases performance of DSP algorithms through innovative additions to the TMS 
architecture. SMJ32010 source code is upward-compatible with SMJ32020 source code and can be 
assembled using the TMS32020 Macro Assembler. 


Increased throughput on the SMJ32020 for many DSP applications is accomplished by means of single- 
cycle multiply/accumulate instructions with a data move option, five auxiliary registers with a dedicated 
arithmetic unit, and faster I/O necessary for data-intensive signal processing. 


The architectural design of the SMJ32020 emphasizes overall speed, communication, and flexibility in 
processor configuration. Control signals and instructions provide floating-point support, block-memory 
transfers, communication to slower off-chip devices, and multiprocessing implementations. 


Two large on-chip RAM blocks, configurable either as separate program and data spaces or as two 
contiguous data blocks, provide increased flexibility in system design. Maintaining program memory off- 
chip allows large address spaces from which large programs of up to 64K words can operate at full speed. 
Programs can also be downloaded from slow external memory to high-speed on-chip RAM. A 64K-word 
data memory address space is included to facilitate implementation of DSP algorithms. The VLSI 
implementation of the SMJ32020 incorporates all of these features as well as many others, such as a 
hardware timer, serial port, and block data transfer capabilities. 


32-bit ALU/accumulator 


The 32-bit Arithmetic Logic Unit (ALU) and accumulator perform a wide range of arithmetic and logical 
instructions, the majority of which execute in a single clock cycle. The ALU executes a variety of branch 
instructions dependent on the status of the ALU. or a single bit in a word. These instructions provide the 
following capabilities: 


¢ Branch to an address specified by the accumulator 
« Normalize fixed-point numbers contained in the accumulator 
¢« Test a specified bit of a word in data memory. 


One input to the ALU is always provided from the accumulator, and the other input may be provided from 
the Product Register (PR) of the multiplier or the input scaling shifter which has fetched data from the 
RAM on the data bus. After the ALU has performed the arithmetic or logical operations, the result is stored 
in the accumulator. 


The 32-bit accumulator is split into two 16-bit segments for storage in data memory. Additional shifters 
at the output of the accumulator perform shifts while the data is being transferred to the data bus for 
storage. The contents of the accumulator remain unchanged. 


scaled shifter 


The SMJ32020 scaling shifter has a 16-bit input connected to the data bus and a 32-bit output connected 
to the ALU. The scaling shifter produces a left shift of O to 16 bits on the input data, as programmed 
in the instruction. The LSBs of the output are filled with zeroes, and the MSBs may be either filled with 
zeroes or sign-extended, depending upon the status programmed into the SXM (sign-extension mode) bit 
of status register STO. 
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16 x 16-bit parallel multiplier 


The SMJ32020 has a two’s-complement 16 x 16-bit hardware multiplier, which is capable of computing 
a 32-bit product in a single machine cycle. The multiplier has the following two associated registers: 


¢ A 16-bit Temporary Register (TR) that holds one of the operands for the multiplier, and 
« A 32-bit Product Register (PR) that holds the product. 


Incorporated in the instruction set are single-cycle multiply/accumulate instructions that allow both operands 
to be processed simultaneously. The data for these operations resides in the on-chip RAM blocks and can 
be transferred to the multiplier each cycle via the program and data buses. 


Four product shift modes are available at the Product Register (PR) output that are useful when performing 
multiply/accumulate operations, fractional arithmetic, or justifying fractional products. 


timer 


The SMJ32020 provides a memory-mapped 16-bit timer for control operations. The on-chip timer (TIM) 
register is a down counter that is continuously clocked by an internal clock. This clock is derived by dividing 
the CLKOUT1 frequency by 4. A timer interrupt (TINT) is generated every time the timer decrements to 
zero. The timer is reloaded with the value contained in the period (PRD) register within the same cycle 
that it reaches zero so that interrupts may be programmed to occur at regular intervals of 4 x (PRD) cycles 
of CLKOUT1. 


memory control 


The SMJ32020 provides a total of 544 16-bit words of on-chip data RAM, divided into three separate 
blocks (BO, B1, and B2). Of the 544 words, 288 words (blocks B1 and B2) are always data memory, and 
256 words (block BO) are programmable as either data or program memory. A data memory size of 544 
words allows the SMJ32020 to handle a data array of 512 words (256 words if on-chip RAM is used 
for program memory), while still leaving 32 locations for intermediate storage. When using block BO as 
program memory, instructions can be downloaded from external program memory into on-chip RAM and 
then executed. 


When using on-chip program RAM or high-speed external program memory, the SMJ32020 runs at full 
speed without wait states. However, the READY line can be used to interface the device to slower, less- 
expensive external memory. Downloading programs from slow off-chip memory to on-chip program RAM 
speeds processing while cutting system costs. 


The SMJ32020 provides three separate address spaces for program memory, data memory, and I/O. The 
on-chip memory is mapped into either the 64K-word data memory or program memory space, depending 
upon the memory configuration. The CNFD (configure block BO as data memory) and CNFP (configure 
block BO as program memory) instructions allow dynamic configuration of the memory maps through 
software. Regardless of the configuration, the user may still execute from external program memory. 


Six registers are mapped into the data memory space: a serial port data receive register, serial port data 
transmit register, timer register, period register, interrupt mask register, and global memory allocation 
register. 
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PROGRAM 
0(>0000) 
: INTERRUPTS 
AND RESERVED 
(EXTERNAL) 
31(>001F) 
32(>0020) 


EXTERNAL 


65,535(>FFFF) 


PROGRAM 


0(>0000) 


INTERRUPTS 
AND RESERVED 
(EXTERNAL) 


31(>001F) 
32(>0020) 


EXTERNAL 


DATA vO 
0(>0000) 0 
ON-CHIP 
MEMORY-MAPPED EXTERNAL 
REGISTERS 
5(>0005) 15 
RESERVED 
95(>005F) 
96(>0060) ON-CHIP 
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RESERVED PAGES 1-3 
511(>O1FF) 
512(>0200) 
oxen Le maces as 
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1023(>03FF) 
1024(>0400) 
65,535(>FFFF) 
(a) ADDRESS MAPS AFTER A CNFD INSTRUCTION 
DATA vo 
0(>0000) 0 
ON-CHIP 
MEMORY-MAPPED EXTERNAL 
REGISTERS 
5(>0005) 15 
RESERVED 
95(>005F) 
96(>0060) ON-CHIP 
127(>007F) ene 
RESERVED PAGES 1-3 
511(>01FF) 


65,279(>FEFF) 
65,280(>FFOO) 


ON-CHIP 
BLOCK BO 


65,535(>FFFF) 


EXTERNAL 
65,535(>FFFF) 


(b} ADDRESS MAPS AFTER A CNFP INSTRUCTION 


512(>0200) 


aml 
767(>02FF) EXIST 
768(>0300) BaeesGe 


PAGES 4-5 


PAGES 6-7 


1024(>0400) 
PAGES 8-511 


FIGURE 1. MEMORY MAPS 
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interrupts and subroutines 


The SMJ32020 has three external maskable user interrupts INT2-INTO, available for external devices 
that interrupt the processor. Internal interrupts are generated by the serial port (RINT and XINT), by the 
timer (TINT), and by the software interrupt (TRAP) instruction. Interrupts are prioritized with reset (RS) having 
the highest priority and the serial port transmit interrupt (XINT) having the lowest priority. All interrupt 
locations are on two-word boundaries so that branch instructions can be accommodated in those locations 
if desired. 


A built-in mechanism protects multicycle instructions from interrupts. If an interrupt occurs during a 
multicycle instruction, the interrupt is not processed until the instruction is completed. This mechanism 
applies both to instructions that are repeated or become multicycle due to the READY signal. 


external interface 


The SMJ32020 supports a wide range of system interfacing requirements. Program, data, and I/O address 
spaces provide interface to memory and I/O, thus maximizing system throughput. I/O design is simplified 
by having !/O treated the same way as memory. 1/O devices are mapped into the I/O address space using 
the processor's external address and data buses in the same manner as memory-mapped devices. Interface 
to memory and I/O devices of varying speeds is accomplished by using the READY line. When transitions 
are made with slower devices, the processor waits until the other device completes its function and signals 
the processor via the READY line. Then, the SMJ32020 continues execution. 


A serial port provides communication with serial devices, such as codecs, serial A/D converters, and other 
serial systems. The interface signals are compatible with codecs and many other serial devices with a 
minimum of external hardware. The serial port may also be used for intercommunication between processors 
in multiprocessing applications. 


The serial port has two memory-mapped registers: the data transmit register (DXR) and the data receive 
register (DORR). Both registers operate in either the byte mode or 16-bit word mode, any may be accessed 
in the same manner as any other data memory location. Each register has an external clock, a framing 
synchronization pulse, and associated shift registers. One method of multiprocessing may be implemented 
by programming one device to transmit while the others are in the receive mode. 


multiprocessing 


The flexibility of the SMJ32020 allows configurations to satisfy a wide range of system requirements and 
can be used as follows: 


e A standalone processor 

¢ A multiprocessor with devices in parallel 

« A stave/host multiprocessor with global memory space 

« A peripheral processor interfaced via processor-controlled signals to another device. 


For multiprocessing applications, the SMJ32020 has the capability of allocating global data memory space 
and communicating with that space via the BR (bus request) and READY control signals. Global 
memory is data memory shared by more than one processor. Global data memory access must be arbitrated. 
The 8-bit memory-mapped GREG (global memory allocation register) specifies part of the SMJ32020’s 
data memory as globa! external memory. The contents of the register determine the size of the global 
memory space. If the current instruction addresses an operand within that space, BR is asserted to request 
control of the bus. The length of the memory cycle is controlled by the READY line. 


The SMJ32020 supports DMA (direct memory access) to its external program/data memory using the 
HOLD and HOLDA signals. Another processor can take complete control of the SMJ32020's external 
memory by asserting HOLD low. This causes the SMJ32020 to place its address, data, and control 
lines in a high-impedance state, and assert HOLDA. 
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instruction set 


The SMJ32020 microprocessor implements a comprehensive instruction set that supports both numeric- 
intensive signal processing operations as well as general-purpose applications, such as multiprocessing 
and high-speed control. The SMJ32010 source code is upward-compatible with SMJ32020 source code. 


For maximum throughput, the next instruction is prefetched while the current one is being executed. Since 
the same data lines are used to communicate to external data/program or I/O space, the number of cycles 
may vary depending upon whether the next data operand fetch is from internal or external program memory. 
Highest throughput is achieved by maintaining data memory on-chip and using either internal or fast external 
program memory. 


addressing modes 


The SMJ32020 instruction set provides three memory addressing modes: direct, indirect, and immediate 
addressing. 


Both direct and indirect addressing can be used to access data memory. In direct addressing, seven bits 
of the instruction word are concatenated with the nine bits of the data memory page pointer to form the 
16-bit data memory address. |ndirect addressing accesses data memory through the five auxiliary registers. 
tn immediate addressing, the data is based on a portion of the instruction word(s). 


In direct memory addressing, the instruction word contains the lower seven bits of the data memory address. 
This field is concatenated with the nine bits of the data memory page pointer to form the full 16-bit address. 
Thus, memory is paged in the direct addressing mode with a total of 512 pages, each page containing 
128 words. 


Five auxiliary registers (ARO-AR4) provide flexible and powerful indirect addressing. To select a specific 
auxiliary register, the Auxiliary Register Pointer (ARP) is loaded with either 0, 1, 2, 3, or 4 for ARO through 
AR4, respectively. 


There are five types of indirect addressing: auto-increment or auto-decrement, post-indexing by either adding 
or subtracting the contents of ARO, or single indirect addressing with no increment or decrement. All 
operations are performed on the current auxiliary register in the same cycle as the original instruction, 
followed by a new ARP value being loaded. 


repeat feature 


A repeat feature, used with instructions such as multiply/accumulates, block moves, 1/O transfers, and 
table read/writes, allows a single instruction to be performed up to 256 times. The repeat counter (RPTC) 
is loaded with either a data memory value (RPT instruction) or an immediate value (RPTK instruction). The 
value of this operand is one Jess than the number of times that the next instruction is executed. Those 
instructions that are normally multicycle are pipelined when using the repeat feature, and effectively become 
single-cycle instructions. 


instruction set summary 


Table 1 lists the symbols and abbreviations used in Table 2, the instruction set summary. Table 2 consists 
primarily of single-cycle, single-word instructions. Infrequently used branch, I/O, and CALL instructions 
are multicycle. The instruction set summary is arranged according to function and alphabetized within each 
functional grouping. The symbol (t) indicates those instructions that are not included in the SMJ32010 
instruction Set. 


wip 
TEXAS 
INSTRUMENTS 


POST OFFICE 8OX 1443 @® HOUSTON, TEXAS 77001 


SMJ32020 
DIGITAL SIGNAL PROCESSOR 


TABLE 1. INSTRUCTION SYMBOLS 


symeo. | MEANING 


4-bit field specifying a bit code 

2-bit field specifying compare mode 

Data memory address field 

Format status bit 

Addressing mode bit 

Immediate operand field 

Port address (PAO through PA15 are predefined 


assembler symbo!s equal to O through 15, 


respectively). 

2-bit field specifying P register output shift code 
3-bit operand field specifying auxiliary register 
4-bit left-shift code 

3-bit accumulator left-shift field 
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TABLE 2. INSTRUCTION SET SUMMARY 
ACCUMULATOR MEMORY REFERENCE INSTRUCTIONS 


NO. INSTRUCTION BIT CODE 


M NIC DESCRIPT 
NEMON! sc 1ON WORDS 


1514131211109 8765 43210 
Absolute value of accumulator 
Add to accumulator with shift 
Add to high accumulator 
Add to low accumulator with 


ooo°0 
ooo 


sign extension suppressed 


°o 
°o 


Add to accumulator with shift specified by 


T register 


Add to accumulator long immediate with shift 
AND with accumulator 

AND immediate with accumulator with shift 
Complement accumulator 

Load accumulator with shift 

Load accumulator immediate short 

Load accumulator with shift specified by T register 
Load accumulator long immediate with shift 
Negate accumulator 

Normalize contents of accumulator 

OR with accumulator 

OR immediate with accumulator with shift 
Store high accumulator with shift 

Store low accumulator with shift 

Subtract from accumulator long immediate with 
shift 

Shift accumulator left 

Shift accumulator right 

Subtract from accumulator with shift 


]—-=.e2ND @ @ @B Dh we — &@ @ HY |] DH 
oo-0o7r7+4 42 0-20@6@ 20 = 
ee oe eae ee ow own OO a = = 
-r-r ao oocooo-aoa0o0o oO 
oor0co0o0°o-0 0002+02 


Conditional subtract 
Subtract from high accumulator 


oooor?r+ = 
24s oe Ow = oa 
oo oo 00 0 
ooo -00- 


Subtract from low accumulator with sign 
extension suppressed 


°o 
° 
° 


Subtract from accumulator with shift specified by 

T register 

Exclusive-OR with accumulator 

Exclusive-OR immediate with accumulator with shift 
Zero accumulator. 

Zero low accumulator and load high accumulator 


ooo0o;970 


Zero accumulator and load low accumulator with 
sign extension suppressed 


TThese instructions not included in the SMJ32010 instruction set. 
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TABLE 2. INSTRUCTION SET SUMMARY (CONTINUED) 


AUXILIARY REGISTERS AND DATA PAGE POINTER INSTRUCTIONS 


oO. NSTRUCTION BIT COD 
MNEMONIC DESCRIPTION y INSTRUCTION BIT CODE 
WORDS 


1514131211109 8 76543210 


11001010 0@CM> 
+R | +92 — > 


Compare auxiliary register with auxiliary register ARO 
Load auxiliary register 

+R + | _——_——_ 
101310001 «RR» 


1 
ie) 
Load auxiliary register immediate short ie) 
) 
0 10 1-—_—)-——_—+> 
1 
0 
oO 
(e) 


Load auxiliary register pointer 

Load data memory page pointer 

Load data memory page pointer immediate 00 «————K—_———_> 
+R 00000000 
101 | «———D——> 


Load auxiliary register long immediate 
Modify auxiliary register 


Store auxiliary register 
T REGISTER, P REGISTER, AND MULTIPLY INSTRUCTIONS 


O. INSTRUC 
MNEMONIC DESCRIPTION N UCTION BIT CODE 
WORDS 


+R | +) > 


ooo 
oorr00+0- 
Se ed 
-oo0 00 00 
--40+4+4+0=20 


1514131211109 8765 43 2 1 
Add P register to accumulator 
Load high P register 
Load T register 
Load T register and accumulate previous product 
Load T register, accumulate previous product, 


oo 0o0- 
ooo 


and move data 

Load T register and store P register in accumulator 
Load T register and subtract previous product 
Multiply and accumulate 

Multiply and accumulate with data move 

Multiply (with T register, store product in P register) 
Multiply immediate 

Load accumulator with P register 

Subtract P register from accumulator 

Set P register output shift mode 

Square and accumulate 


ee so ee ond = a 
oorvr 2 -?00000 


=-O0O-44 00224 4540 
o-oo 0 2+0002+ 


Square and subtract previous product 


tThese instructions not included in the SMJ32010 instruction set. 
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TABLE 2. INSTRUCTION SET SUMMARY (CONTINUED) 


BRANCH CALL INSTRUCTIONS 


MNEMONIC DESCRIPTION NO. INSTRUCTION BIT CODE 
WORDS 


1514131211109 876543210 
<+£$£___p—__> 
0100101 


= 
= 


Branch unconditionally 

Branch to address specified by accumulator 
Branch on auxiliary register not zero 
Branch if TC bit # O 

Branch if TC bit = 0 

Branch if accumulator = O 

Branch if accumulator > O 

Branch on I/O status = O 

Branch if accumulator < O 

Branch if accumulator < O 

Branch if no overflow 

Branch if accumulator # O 

Branch on overflow 

Branch if accumulator 

Call subroutine indirect 

Call subroutine 

Return from subroutine 


NA NNN NN NY NNN NN ND — ND 
o-ofrft7 2222224264 20 2 
o7r-OfFfF27 2 2 2 ep Se eww = O 
—"m~e - odoncodncdococTonceoeo-H-d0c01or=- s+ = = 
emer OP HP oocdoeoerH-oCcoe = 
aeer = OOnfm Hfm=aeHooeooH-=- 
ooocorrr0e0+00++ 0 


1 
CONTROL INSTRUCTIONS 


NO. INSTRUCTION BIT CODE 
MNEMONIC DESCRIPTION WORDS 


1514131211109 876543210 


| 


Test bit 

Test bit specified by T register 
Configure block as data memory 
Configure block as program memory 
Disable interrupt 

Enable interrupt 

Idle until interrupt 

Load status register STO 

Load status register ST1 

No operation 

Pop top of stack to low accumulator 
Pop top of stack to data memory 
Push data memory value onto stack 
Push low accumulator onto stack 
Reset overflow mode 

Repeat instruction as specified by data memory value 
Repeat instruction as specified by immediate value 
Reset sign-extension mode 

Set overflow mode 

Store status register STO 

Store status register ST1 

Set sign-extension mode 

Software interrupt 


-=-=007r+4 420 2 20 0-00 0227-424 40 = 
Ce nn ee ee ee ee ee ee eee 
ooreerOo 00000 020 00 00 00 00 0 


oor feo0o0o0qo0q0o 0 + +7072 2000002 
wae een won er Osan O00 44420 


Pn oon noone en Owns oo aa ow nw a 
ooro0coo0o0orr0qo0coe0coecee0errvraqao0qo0qo0q0o0o0 = 


tThese instructions not inctuded in the SMJ32010 instruction set. 
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MNEMONIC 


TABLE 2. INSTRUCTION SET SUMMARY (CONCLUDED) 


VO AND DATA MEMORY OPERATIONS 


NO. INSTRUCTION BIT CODE 
DESCRIPTION WORDS 


1514131211109 876543210 


= 
= 


Block move from data memory to data memory 
Block move from program memory to data memory 
Data move in data memory 

Format serial port registers 

Input data from port 

Output data to port 

Reset serial port transmit mode 

Reset external flag 

Set serial port transmit mode 

Set external flag 

Table read 

Table write 


2 
2 
1 6 
1 
1 
1 
1 
1 
1 
1 
1 
1 


oo mw Se a ow oe Om a 
eee ow en no Om = a= 
cooooeos~aoao- 
--9o000000- = 
--oeoc000~-0-- 


tThese instructions not included in the SMJ32010 instruction set. 
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development support 


Texas Instruments offers an extensive line of development support products to assist the user in all aspects 
of SMJ320 second-generation-based design and development. These products range from development 
and application software to complete hardware development and evaluation systems such as the XDS/22. 
Table 4 lists the development support products for the second-generation SMJ320 devices. 


System development begins with the use of the SoftWare Development System (SWDS) or Emulator (XDS). 
These tools allow the designer to evaluate the processor’s performance, benchmark time-critical code, 
and determine the feasibility of using a SMJ320 device to implement a specific algorithm. 


Software and hardware can be developed in parallel by using the macro assembler/linker, simulator, and 
SoftWare Development System for software development and the XDS for hardware development. The 
assembler/linker translates the system’s assembly source program into an object module that can be 
executed by the simulator, XDS, or SWDS. The XDS provides realtime in-circuit emulation and is a powerful 
tool for debugging and integrating software and hardware modules. 


Additional support for the SMJ320 products consists of extensive documentation and three-day DSP design 
workshops offered by the Tl Regional Technology Centers (RTCs). The workshops provide hands-on 
experience with the TMS320 development tools. Refer to the 7M@S320 Family Development Support 
Reference Guide for further information about TMS320 development support products and DSP workshops. 
When technical questions arise regarding the TMS320, contact the Texas Instruments TMS320 Hotline, 
(713) 274-2320. 


TABLE 4. TMS320 SECOND-GENERATION SOFTWARE AND HARDWARE SUPPORT 


SOFTWARE TOOLS PART NUMBER 


Macro Assembler/Linker 
VAX VMS TMDS3242210-08 
TI/IBM MS/PC-DOS TADS3242810-02 


Simulator 
VAX VMS TMDS3242211-08 
TVIBM MS/PC-DOS TMDS324281 1-02 


SoftWare Development System (SWDS) TMDS3268821 


Digital Filter Design Package (DFDP) 
IBM PC-DOS DFDP-IBMOO2 


DSP Software Library 
VAX VMS TMDC3240212-18 
TI/IBM MS/PC-DOS TMDC3240812-12 


HARDWARE TOOLS PART NUMBER 


Analog Interface Board (AIB) RTC/EVM320C-06 
XDS/22 Emulator TMDS3262221 


XDS/22 Upgrade 
Customer Upgrade TMDS3282226 


T™MS320 Design Kit TMS320DDK 
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documentation support 


Extensive documentation supports the second-generation TMS320 devices from product announcement 
through applications development. The types of documentation include data sheets with design 
specifications, complete user’s guides, and 750 pages of application reports published in the book, Digita/ 
Signal Processing Applications with the TMS320 Family. An application report, Hardware Interfacing to 
the TMS320C25, is available for that device. 


A series of DSP textbooks is being published by Prentice-Hall and John Wiley & Sons to support digital 
signal processing research and education. The TMS320 newsletter, Details on Signal Processing, is published 
quarterly and distributed to update SMJ320 customers on product information. The TMS320 DSP bulletin 
board service provides access to large amounts of information pertaining to the SMJ320 family. 


Refer to the 7M@S320 Family Development Support Reference Guide for further information about SMJ320 
documentation. To receive copies of second-generation TMS320 literature, call the Customer Response 
Center at 1-800-232-3200. 
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absolute maximum ratings over specified temperature range (unless otherwise noted)t 


Siipply voltage range, VCGF oh tii 25 beet oh eee hay das ca RIM Ue -O0.3Vto7V 
Input voltage:-range is 2 o oeg RA Oe ie a a kee Ee) Ble da i he See beee -O0.3Vto7V 
Output voltage range .......... PME 8. aiuat ahh AEN de ales ENN Ul Sao lance, Gd NAM oh aa td NS deal -0.3Vto7V 
Continuous power dissipation .. 0.0... eee ee eee ee 2.0 W 
Minimum operating free-air temperature... 6... ee ee eee -55°C 
Maximum operating case temperature................ Bd dha er thhtan yy mute Goren cot OM heed 100°C 
Storage temperature range .... 2... . ee ee ee ee eens -—55°C to 150°C 


TStresses beyond those listed under ‘‘Absolute Maximum Ratings’’ may cause permanent damage to the device. This is a stress rating 
only and functional operation of the device at these or any other conditions beyond those indicated in the ‘‘Recommended Operating 
Conditions’’ section of this specification is not implied. Exposure to absolute-maximum-rated conditions for extended periods may affect 
device reliability. 

TAll voltage values are with respect to Vss. 


recommended operating conditions 


Vcc Supply voltage 
Vss_ Supply voltage 


All inputs except CLKIN 
CLKIN 
All inputs except CLKIN 


Vin High-level input voltage 


Vit Low-level input voltage 


IOH High-level output current 


loL Low-level output current 


Tc Operating case temperature 


Ta Operating free-air temperature 


3 
Voc = MIN, lo = MAX 308 [Vv 
=79 20 
Pi input cuwent———SSC*dSSCSCSC* = gpg SSSCSC~CSCSSSCCS YO SSCS~«*O 
Ta = -55°C, Vcc = MAX, fy = MAX F400 | 
= 


Icc Supply current Ta = 25°C, Vcc = MAX, fy = MAX 


[Eo Output capacitance 


SAll typical values are at Vec = 5V, Ta = 26°C. 


y 


d "4 Caution. This device contains circuits to protect its inputs and outputs against damage due to high static voltages or electrostatic 

fields. These circuits have been qualified to protect this device against electrostatic discharges (ESD) of up to 2 kV according 

to MIL-STD-883C, Method 3015; however, it is advised that precautions be taken to avoid application of any voltage higher than maximum 

rated voltages to these high-impedance circuits. During storage or handling, the device leads should be shorted together or the device 

should be placed in conductive foam. In a circuit, unused inputs should always be connected to an appropriate logic voltage level, preferably 

either Vcc or ground. Specific guidelines for handling devices of this type are contained in the publication ‘Guidelines for Handling Electrostatic- 
Discharge Sensitive (ESDS) Devices and Assemblies’’ available from Texas Instruments. 
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CLOCK CHARACTERISTICS AND TIMING 
The SMJ32020 can use either its internal oscillator or an external frequency source for a clock. 


internal clock option 


The internal oscillator is enabled by connecting a crystal across X1 and X2/CLKIN (see Figure 2). The 
frequency of CLKOUT1 is one-fourth the crystal fundamental frequency. The crystal should be fundamental 
mode, and parallel resonant, with an effective series resistance of 30 ohms, a power dissipation of 1 mW, 
and be specified at a load capacitance of 20 pF. 


67 20 | Mie 


Ta = -55°C MIN 
Tc = 100°C MAX 


CRYSTAL 


C1 U Cc 
ai ‘ei 


FIGURE 2. INTERNAL CLOCK OPTION 


external clock option 


An external frequency source can be used by injecting the frequency directly into X2/CLKIN with X71 left 
unconnected. The external frequency injected must conform to the specifications listed in the following table. 


switching characteristics over recommended operating conditions (see Note 1) 


PARAMETER MIN TYP MAX UNIT 
te(C) CLKOUT1/CLKOUT2 cycle time 200 600 


CLKOUT1/CLKOUT2/STRE fall time ft 


CLKOUT1/CLKOUT2/STRB rise time 


twi(CL) CLKOUT1/CLKOUT2 low pulse duration 20-15 2Q° 20+15 


tw(CH) CLKOUT1/CLKOUT2 high pulse duration 20-15 2Q 20+15 


NOTE 1: Q = 1/4tg(¢). 


i 
TEXAS 
INSTRUMENTS 


POST OFFICE 8OX 1443 @® HOUSTON. TEXAS 77001 


SMJ32020 
DIGITAL SIGNAL. PROCESSOR 


timing requirements over recommended operating conditions (see Note 1) 


te(Cl) CLKIN cycle time 
twiCiL) CLKIN low pulse duration, tc(c}) = 50 ns (see Note 2) 


twiCiH) CLKIN high pulse duration, tcjcj) = 50 ns (see Note 2) 
tsu(s) SYNC setup time before CLKIN low 
this) SYNC hold time from CLKIN low 


NOTES: 1. Q = 1/4tgc). 
2. CLKIN duty cycle [tr(ci) + tw{CIH)!/te(cl) must be within 40-60%. Rise and fall times must be less than 10 ns. 
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FIGURE 3. TEST LOAD CIRCUIT 
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MEMORY AND PERIPHERAL INTERFACE TIMING 


switching characteristics over recommended operating conditions (see Note 1) 


rn 
iW Bata writs hold tine fom STAB high ———SSCSCSC~“‘“‘~*dC ts Cd 
tenb| Oats bus starts bag den ate STR ow Wentecvein ———SSSCSC*dCSCi 
tanec) MC valafomeixours——SSCSCSC~C~SSC*dC( a | 


tThese values were derived from characterization data and are not tested. 
NOTES: 1. Q = 1/4tg(¢), 
3. A15-A0, PS, DS, fS, R/W, and BR timings are all included in timings referenced as ‘‘address.”’ 
4. Delays between CLKOUT1/CLKOUT2 edges and STRB edges track each other, resulting in tw(SL) and tw(SH) being 20 
with no wait states. 


timing requirements over recommended operating conditions (see Note 1) 


ae dp i Se INNES TMA 
ta(A Read data access time from address time (read cycle, see Notes 3 and 5) 30-75 
tsu(D)R Data read setup time before STRB high | 40t—s—idYss 


Q-40! 
2a-s0'| ns _| 


TThese values were derived from characterization data and are not tested. 

NOTES: 1. Q = 1/4tg(c¢). 
3. A15-A0, PS, DS, IS, R/W, and BR timings are all included in timings referenced as ‘‘address.”’ 
5. Read data access time ts defined as taiA) = tsu(A) + tw(SL) — tsu(D)R- 
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RS, INT, BIO, AND XF TIMING 


switching characteristics over recommended operating conditions (see Notes 1 and 6) 


Ftaingy _CLKOUTI low to reset state entered SSCS~S—SSSC 
Peawacky CLKOUTI toIACR vais SSCSCSC~—~SCSCSY S| 
td(XF) XF valid before falling edge of STRB | a-35 =—ti(itsé‘*YL:éCissSd 


tThese values were derived from characterization data and are not tested. 
NOTES: 1. Q = 1/4tgc). 
6 RS, INT, and BIO are asynchronous inputs and can occur at any time during a clock cycle. However, if the specified setup 
time is met, the exact sequence shown in the timing diagrams will occur. INT/BIO fall time must be less than 10 ns. 


timing requirements over recommended operating conditions (see Notes 1 and 6) 


tsultN) INT/BIO/RS setup before CLKOUT?1 high 


th(iN) INT/BIO/RE hold after CLKOUT1 high eee 


twi(iN) INT/BIO low pulse duration tec) 


twiRS) RS low pulse duiation 3teic) 


NOTES: 1. Q = 1/4tg(¢). 
6. RS, INT, and BIO are asynchronous inputs and can occur at any time during a clock cycle. However, if the specified setup 
time is met, the exact sequence shown in the timing diagrams will occur. INT/BIO fall time must be less than 10 ns. 
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"4° 0;0°0'0"0' 60°00 0066+ XXX) OOOO 
TACK QO RCN Porarwreelees 


SERIAL PORT SOCORRO 
CONTROLS? SXRRRRREKRAR KKK RK KR KK 


TControl signals are DS, TS; R/W and XF. 
+Serial port controls are DX and FSX. 
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HOLD TIMING 


switching characteristics over recommended operating conditions (see Note 1) 


NOTES: 1. Q = 1/4tgi¢). 


7. A15-A0, PS, DS, IS, STRB, and R/W timings are all included in timings referenced as ‘‘address."’ 


timing requirements over recommended operating conditions (see Note 1) 


tThese values were derived from characterization data and are not tested. 
NOTE 1: Q = 1/4tgi¢). 


HOLD timing (part A) 


1 


STRB 


HOLD 


A15-A0 


a ME tdis(C1L-A) 


D15-D0 


| 
N N+1 N/A | 


FETCH 


EXECUTE 
—____ i 
HOLDA TT 

| 


THOLD is an asynchronous input and can occur at any time during a clock cycle. If the specified timing is met, the exact sequence shown 
will occur; otherwise, a delay of on CLKOUT2 cycle will occur. 
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HOLD timing (part B) 


CLKOUT1 | 
CLKOUT2 
STRB 
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EXECUTE 9 9 +e 


HOLDA —> h«— td(HH-AH) 
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THOLD is an asynchronous input and can occur at any time during a clock cycle. If the specified timing is met, the exact sequence shown 
will occur; otherwise, a delay of on CLKOUT2 cycle will occur. 
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switching characteristics over recommended operating conditions (see Note 1) 


ta(FL-OX) OX valid after FSX falling edge {TXM = O, see Note 8) 


NOTES: 1. Q = 1/4tg(¢). 
8. The last occurrence of FSX falling and CLKX rising. 


timing requirements over recommended operating conditions (see Note 1) 


Serial port clock (CLKX/CLKR}) high pulse duration (see Note 9) 


twiSCK) 


tsu(FS) __FSX/FSR setup time before CLKX/CLKR falling edge (TXM = 0) 
[Ghion) OR hold ime after CLKR fang edge SSCS SSCS 


NOTES: 1. Q = 1/4tgic). 
9. The duty cycle of the serial port clock must be within 40-60%. Serial port clock (CLKX/CLKR) rise and fall times must 
be less than 50 ns. 


serial port receive timing 
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serial port transmit timing 
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MECHANICAL DATA 
68-pin GB pin grid array ceramic package 


TOP VIEW 
28,448 (1.120) 


27,432 (1.080) 
17,02 (0.670) 
| NOM "| 


THERMAL RESISTANCE CHARACTERISTICS 


PARAMETER 


28,448 (1.120) 


27,432 (1.080) 
R Junction-to-free-air 
JA thermal resistance TESTE S70} 


Junction-to-case 


R 
JC thermal resistance 


4,953 (0.705) 
2,794 (0.110) 1,397 (0.055) 
+” MAX 
|i 
3,302 (0.130) 0,508 (0.020) | 1,270 (0.050) DIA 
2.794 (0.110) 0.406 (0.015) NOM 
2,54 (0.100) 
2.54 (0.100) BOTTOM VIEW 
L 
K aa (0.100) 
r TP. 
H 
G 
F 
E 
D 
1,524 (0.060) 
Fane 
B|\O©O}O©OQOOOOOO® 4 PLACES 
A\ ©©OOG©0©6 G—\+ 


12 3 4 &§ 6 7 8 9 1011 ¥ 1,27 (0.050) 
NOM 


ALL LINEAR DIMENSIONS ARE IN MILLIMETERS AND PARENTHETICALLY IN INCHES 
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FJ ceramic leaded chip carrier packages 


20,52 (0.808) 
20,19 (0.795) 


i 
a 


0,51 (0.020) 
0,25 (0.010) 


25,40 (1.000) 
24,89 (0.980) 


24,51 (0.950) 
23,88 (0.910) 


0,89 (0.035) 


THERMAL RESISTANCE CHARACTERISTICS 


| PARAMETER | MAX|_UNIT | 

PsA iowa mens | 22 | “| 
thermal resistance 

foee weeps Rice 
thermal resistance 


2,41 (0.095) 


soiled 


0,28 (0.011) 
0,18 (0.007) 
0,48 (0.019) LID 
0,33 (0.013) 


0,38 (0.015) 
0,13 (0.005) 


r 


0,81 (0.032) 
0,58 (0.023) 


iri | 0,64 (0.025) 
0.76 (0.030) a | ase (0.140) 


ALL DIMENSIONS ARE IN MILLIMETERS AND PARENTHETICALLY IN INCHES 
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C. TMS320C2x System Migration 


This appendix contains information necessary to upgrade a first-generation 
TMS320 (TMS320C1x) program to a TMS32020-based system or a 
TMS32020 program to a TMS320C25-based system. The information con- 
sists of a detailed list of the programming differences and hardware and timing 
differences between the respective processors. 


The two major sections are as follows: 


e TMS320C1x to TMS32020 System Migration (Section C.1 on page 
C-2) 


®@ TMS32020 to TMS320C25 System Migration (Section C.2 on page 
C-4) 


Appendix C - TMS320C1x to TMS32020 System Migration 


C.1 TMS320C1x to TMS32020 System Migration 


C-2 


This section lists the programming differences that should be considered in 
migrating from a TMS320C1x to a TMS32020 processor. 


1) Instructions are compatible only at the mnemonic level. TMS320C1x 
source programs should be reassembled using a TMS32020 assembler 
before execution. 


2) The memory map on the TMS32020 is different from the memory map 
on the TMS320C1x. Page 0 of the TMS32020’s data memory map 
contains only block B2 (32 words) and the memory-mapped registers. 
The primary on-chip RAM blocks BO and B1 reside on pages 4-7 when 
all RAM is configured as data memory. It should be noted that there 
may be cases in TMS320C1x programs where the BANZ instruction has 
been used to implement both a loop counter and a memory address 
pointer for tables based at location 0 in memory. Since blocks BO, B1, 
and B2 in the TMS32020 are located at addresses other than 0, pro- 
grams being migrated from the TMS320C1x to the TMS32020 should 
implement this type of BANZ loop using two separate auxiliary registers, 
one for loop count and one for memory address. 


3) The SXM bit must be set to 1 and the PM bits must be set to 0 to ensure 
that TMS32020 CALU operations behave in the same manner as the 
TMS320C1x. The SXM and PM bits are unaffected by a reset and are in 
a random State after powerup. 


4) The organization of status register STO is different on the two processors 
as shown below. 


TMS320C1x Status Register STO: 


16-14 998 92! AT! 40) 8 8 6 OB 
|OV}OVM]INTM{ 1. 31. 1 «1 /ARP{ 106101 1°14 ~=4 = 1 


TMS32020 Status Register STO: 


15 14 13 12 #11 10 9 8 7 6 5 4 3 2 1 #9 


arP_Lovjovm[ 1 [wr] SCPC 


5) In the direct addressing mode, the SST (store status register STO) in- 
struction of the TMS32020 sets DP = 0, rather than 1 as on the 
TMS320C1x. The SST1 instruction also sets DP = 0 in the direct ad- 
dressing mode. Note that in the direct addressing mode, data memory 
addressing values should be between 96 and 127 to store the status 
registers in block B2. 


6) When modifying the contents of the current auxiliary register in the in- 
direct addressing mode on the TMS32020, the SAR (store auxiliary 
register) instruction for ARn when n = ARP stores the value of the aux- 
iliary register contents BEFORE it is incremented, decremented, or in- 
dexed by ARO. The TMS320C1x stores the incremented/decremented 
value. 


to 
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7) 


8) 


9) 


10) 


11) 


12) 


13) 


14) 


All of the TMS32020 branch and call instructions, except for BACC 
(branch to address specified by accumulator) and CALA (call subroutine 
indirect), allow both auxiliary register and auxiliary register pointer 


(ARP) modification in the seven LSBs of the opcode. 


The SACL (store low accumulator with shift) instruction on the 
TMS32020 allows shift codes of 0, 1, and 4. 


A multiplication of >8000 x >8000 on the TMS32020 yields the correct 
result of >40000000, not >COQ000000 as on the TMS320C1x. 


The multiply instructions, MPY and MPYK, are not interrupt-protected 
on the TMS32020 since the capability now exists to restore the P reg- 
ister directly. 


The IN and OUT opcodes now have a 4-bit port address to allow for a 
total of 16 I/O ports on the TMS32020. 


A TBLW (table write) instruction on the TMS32020 to program memory 
locations 0-7 can be distinguished externally from an OUT instruction 
to port addresses 0-7 via the PS and IS (program and I/O space select) 
strobes. 


The SUBC (conditional subtract) instruction is a true single-cycle in- 
struction on the TMS32020 and can be used with the repeat in- 
structions, RPT or RPTK. On the TMS320C1x, SUBC cannot be 
followed immediately by another instruction that uses the accumulator. 


When modifying the auxiliary registers in the indirect addressing mode 
on the TMS32020, the auxiliary registers act as 16-bit, rather than 8-bit 
counters (i.e., ‘wraparound’ occurs modulo 2'%, instead of modulo 28 
as on the TMS320C1x). When used with the BANZ (branch on auxiliary 
register not zero) instruction, the auxiliary registers on the TMS32020 
act as 16-bit counters, rather than 9-bit counters as on the TMS320C1x. 
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C.2 TMS32020 to TMS320C25 System Migration 


This section lists the programming, hardware, and timing differences that 
should be considered in migrating from the TMS32020 to the TMS320C25. 


C-4 


1) 


2) 


3) 


5) 


6) 


7) 


8) 


9) 


Instructions are fully compatible at the object code level. TMS32020 
object (memory image) code can be used directly on the TMS320C25 
processor. 


Instructions are compatible at the source code level. The NORM in- 
struction that previously had no operands now has an optional operand 
to define the auxiliary register modification. Any comments on the same 
line in the source code file will be interpreted as the operand if no other 
operand is specified. NORM instructions should be modified to specify 
the default operand, *+. 


When zero is loaded into the accumulator and the NORM instruction is 
executed, the auxiliary register (ARx) on the TMS320C25 is modified 
and the TC is set on the first execution. On the TMS32020, the auxiliary 
register (ARx) is incremented each execution cycle and the TC is not set. 


Execution cycle timings of instructions have been modified. Most 
TMS320C25 instructions execute in a single machine cycle. The number 
of cycles for some multicycle instructions have been changed. Refer to 
Appendix D for detailed information on instruction cycle timings. By 
following the entries in this appendix, the key timing differences can be 
noted. 


The IDLE instruction automatically sets the INTM bit in status register 
STO to a zero. This assures that an external interrupt will “wake up’ the 
processor. The instruction also requires three memory cycles to execute 
on the TMS320C25 rather than one as on the TMS32020. 


In general, all branch, call, and return instructions that reload the pro- 
gram counter (PC) should be counted as three-cycle instructions when 
evaluating code execution timings on the TMS320C25. 


When an interrupt occurs, one additional instruction cycle will be pres- 
ent on the TMS320C25 prior to interrupt acknowledge. When the device 
is released from the hold mode, there will be one additional cycle pre- 
ceding the first valid memory fetch. 


The store instructions (SACH, SACL, etc.) execute in one less cycle on 
the TMS320C25 than on the TMS32020 when data is stored to external 
data memory. 


The MAC and MACD instructions require one extra cycle, going from 
three to four cycles. The extra cycle is in the instruction read and setup 
overhead, and repeated execution will be one cycle per execution as on 
the TMS32020. 
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10) The delay for a new memory configuration to become effective when 
using the CNFD or CNFP instructions on the TMS320C25 is two in- 
struction fetches (for single-cycle instructions) when executing from 
external memory or internal ROM, as compared to one instruction fetch 
for the TMS32020. Thus, on the TMS320C25, a CNFP instruction must 
be placed at location 65277 if execution is to continue from the first lo- 
cation in block BO. When execution is from internal RAM on the 
TMS320C25, however, this delay is one instruction fetch as on the 


TMS32020. 


11) The timer on the TMS320C25 is clocked by CLKOUT1 and counts PRD 
+ 1 CLKOUT1 cycles, whereas the timer on the TMS32020 is clocked 
by CLKOUT1/4 and counts 4 x PRD cycles. Therefore, to count an 
equivalent amount of time on the TMS320C25 using the same input 
clock frequency, PRD values from the TMS32020 must first be multi- 
plied by four and then decremented by one. If different input clock fre- 
quencies are used, this must also be accounted for by multiplying the 
PRD value for the TMS320C25 obtained above by the ratio of the 
TMS320C25 input clock frequency to the TMS32020 input clock fre- 


quency. 


12) To simplify device timing descriptions, the internal clock phase reference 
numbers have been redefined in the TMS320C25. The new clock phase 
definitions have quarter-phase 1 (Q1) beginning a bus cycle, as op- 
posed to a cycle beginning with Q3 as in the TMS32020. Note that no 
changes have been made to any of the device logic; the clock phases 


have merely been renamed. 


13) The effect of the SYNC input, although functionally the same on the 
TMS32020 and TMS320C25, is delayed by two cycles on the 
TMS320C25 from that of the TMS32020. Accordingly, the exact tim- 
ings produced with the application of SYNC on the two devices may 


differ depending on the clock phase in which SYNC is applied. Due to 
the two-cycle offset between the clock phase definitions on the two 
devices (see the previous paragraph) and the two-cycle delay in the ef- 
fect of SYNC on the TMS320C25, the clock timings produced when the 
two devices are running in synchronization are identical. That is, a 
TMS32020 and a TMS320C25 can be operated together in synchroni- 


Zation in a system using the same SYNC input. 


14) On the TMS320C25, both the timer (TIM) and period (PRD) registers 
are initialized to >FFFF on reset, while on the TMS32020, only the TIM 


register is initialized. 


15) Several bits (C, HM, and FSM) have been added to status register ST1 


on the TMS320C25, as shown below. 
TMS32020 Status Register ST1: 


14 13 #12 = «11 


Pane [ene] re [Sx 1 tae po FJ 


TMS320C25 Status Register ST1: 


15 14 13 12 11 10 


Pans [ene] te [exw] ¢ [1 1 Taw [rem Dr [rope Fa 
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16) 


The FSM, HM, and C status register bits are initialized by reset and are 
all set to one when reset occurs. Note that the new bits are assigned 
polarities in such a way that the values of the corresponding bits on the 
TMS32020 invoke a TMS32020-like operation on the TMS320C25. 


The SXM and PM status register bits that were previously uninitialized 
on the TMS32020 are now initialized by reset on the TMS320C25. 
When the TMS320C25 is reset, SXM is set to one, and the PM bits are 
set to zero. 


Four differences between the serial ports on the TMS32020 and 
TMS320C25 that impact system migration are: 


a) The double-buffering on the TMS320C25 serial port greatly in- 
creases the amount of time available for processing serial port in- 
terrupts and affects how the FSR and FSX pulse are used. As a 
result of the double-buffering, both edges of the FSR and FSX 
pulses are used on the TMS320C25 instead of only the falling 
edge, as is the case on the TMS32020. 


On the TMS32020, the falling edge of the FSX pulse is used to 
Start transmission of the data present in the DXR (transmit regis- 
ter). Likewise, the falling edge of the FSR pulse is used to start 
reception of data into the DRR (receive register). The limitations 
on the FSR and FSX pulses are a minimum setup time (20 ns) and 
a minimum hold time (20 ns). Once serial port operation begins, 
the receipt or transmission of the register’s contents, either 8 or 16 
bits, is completed even if the FSR or FSX signals change to a logic 
high level. A new transfer of data into the DRR or out of the DXR 
only begins when the next falling edge of the FSR or FSX pulses 
occurs. 


On the TMS320C25, the double-buffering affects the use of the 
FSR and FSX pins and consequently the serial port operation itself. 
For the transmit operation, the TMS320C25 provides a separate 
XSR (transmit shift register), necessitating the use of the rising 
edge of the FSX pulse. Data is transferred from the DXR to the XSR 
on the first falling CLKX (serial transmit clock) following a rising 
FSX. At this point, the data is in the XSR and waiting to be shifted 
out or transmitted. Transmission begins on the first falling CLKX 
following the falling FSX, and continues with the subsequent bits 
in the XSR as long as the FSX signal remains low. If the FSX sig- 
nal goes high before the last transmission has completed, the 
contents of the DXR are transferred to the XSR and the previous 
transmission is aborted. Transmission of this new information be- 
gins after the FSX signal goes low again. 


Similarly for the receive operation, the TMS320C25 has a separate 
RSR (receive shift register). In this case, the data is transferred 
from the RSR to the DRR when the last bit has been received. 
Therefore, if a new transfer is initiated by toggling the FSR pin, the 
previous reception is aborted and the contents of RSR are not 
transferred to DRR. 


Consequently, there is one additional limitation on the FSR and 
FSX pulses on the TMS320C25. FSR and FSX must have a mini- 
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b) 


Cc) 


qd) 


mum low pulse duration to allow the complete transfer of all 8 or 
16 bits of data into and out of RSR and XSR, respectively. 


Unlike the TMS32020, loading the DXR does not interfere with 
transmission. There is no restriction on when the DXR can be 
loaded when using external FSX. Correspondingly, DRR may be 
read at any time during the reception of the current data, extending 
the time allowed to respond to the receive interrupt and to read the 
previous word of data. 


The fully static operation of the TMS320C25 effectively places no 
lower limit on serial port clock frequency. 


Serial port interrupts are generated half of a CLKR or CLKX cycle 
later on the TMS320C25 than they are on the TMS32020. Spe- 
cifically, on the TMS32020, RINT and XINT are generated on the 
falling edge of CLKR and CLKX, respectively, during transfer of the 
last bit. On the TMS320C25, RINT and XINT are generated on the 
rising edge of CLKR or CLKX after the last bit has been transferred. 
This should not be critical for TMS32020 programs running on the 
TMS320C25 since double-buffering of the serial port on the 
TMS320C25 allows more time for processing of serial port inter- 
rupts. Some modification of TMS32020 programs may, however, 
be required to take advantage of the double-buffering, depending 
on how serial port interrupt servicing is implemented. 


The DRR behaves differently when operating the TMS320C25 se- 
rial port in byte mode than it does on the TMS32020. On the 
TMS32020, the contents of the most-significant byte of DRR re- 
main unchanged once byte mode is initiated by executing a FORT 
instruction. On the TMS320C25,’ however, each time a new byte 
is received, the previous contents of the least-significant byte of 
DRR are transferred to the most significant byte of DRR. 


Figure C-1 illustrates the behavior of DRR on both the TMS32020 
and the TMS320C25 processors. 
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TMS320C25 TMS32020 


MSB LSB MSB LSB 
Initial 
Conditions 


After 1st Receive 
(Byte ’A’) 


After 2nd Receive 
(Byte B’) 


Etc. 


Figure C-1. Serial Port System Migration 
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D. Instruction Cycle Timings 


This appendix details the instruction cycle timings for the TMS32020 and 
TMS320C25 processors. Instructions for each device are first listed in a table 
according to cycle classification. Then each class of instructions is listed in 
another table(s), showing the number of cycles required for a given 
TMS320C2x instruction to execute in a given memory configuration when 
executed as a single instruction or in the repeat mode. The column headings 
in the tables indicate the program source location (PI, PE, or PR) and data 
destination or source (DI or DE), defined as follows: 


PI The instruction executes from internal program memory (RAM). 
PR __ The instruction executes from internal program memory (ROM). 
PE The instruction executes from external program memory. 

DI The instruction executes using internal data memory. 

DE The instruction executes using external data memory. 


The number of cycles required for each instruction is given in terms of the 
program/data memory and !/O access times as defined in the following listing: 


p Program memory wait states. Represents the number of clock cycles the 
device waits for external program memory to respond to an access. 
Tac is the access time, in nanoseconds, (maximum) required by the 
TMS320C2x for an external memory access to be made with no wait 
states. Tmem is the memory device access time, and Tp is the clock pe- 
riod (4/crystal frequency). 


p= 0; If Tmem S Tac 
p=1; If Tac <Tmem S$ (T p + Tac) 
p = 2; If (Tp + Tac)< T mem S$ (Tp x 2 + Tac) 
p=k,; If [Tp x (k-1) + Tac]< T mem S (Tp x k + Tac) - 
d Data memory wait states. Represents the number of cycles the device 


must wait for external data memory to respond to an access. This 
number is calculated in the same way as the p number. 


i 1/O memory wait states. Represents the number of cycles the device 
must wait for external |/O memory to respond to an access. This num- 
ber is calculated in the same way as the p number. 


Other abbreviations used in the tables and their meanings are as follows: 


br Branch from ... 

int Internal program memory. 

INT Interrupt. 

ext External program memory. 

n The number of times an instruction is executed when using the RPT or 
RPTK instruction. 
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D.1 TMS32020 Instruction Cycle Timings 


Table D-1 lists the TMS32020 instructions according to cycle classification. 
Table D-2 shows the number of cycles required for a given TMS32020 in- 
struction to execute in a given memory configuration when executed as a 
single instruction or in the repeat mode, respectively. 


Table D-1. TMS32020 Instructions by Cycle Class 


CLASS INSTRUCTION 
= ADD ADDH ADDS ADDT AND _ BIT BITT DMOV LAC LACT 
BACC CALA POP PUSH RET TRAP 
(BACC, CALA, RET, TRAP not repeatable) 


LAR LDP LPH LST LST1_ LT LTA LTD LTP LTS 

MPY OR RPT SQRA SORS SUB  SUBC SUBH SUBS  SUBT 
XOR _ZALH ZALS__(RPT not repeatable) 

VIN. OUR 

| IX | TBLR  TBLW 

x __ BLKD 


LARK LARP LDPK MAR MPYK NEG NOP NORM PAC ROVM 
RPTK RSXM RTXM_ RXF SFL SFR SOVM SPAC SPM SSXM 
STXM = SXF ZAC 

(LACK, LARK, LDPK, MPYK, RPTK, SPM, ZAC not repeatable) 


ADLK ANDK LALK LRLK ORK ~-SBLK- XORK (all not repeatable) 
MAC MACD 


ae BBNZ BBZ BGEZ BGZ BIOZ BLEZ BLZ BNV 
ae BZ CALL = (all not repeatable) 


SACH SACL SAR SST SST1 
ABS APAC CMPL CMPR CNFD CNFP_ ODINT ~ EINT FORT LACK 
POPD PSHD 

Xl IDLE (not repeatable) 
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Table D-2. TMS32020 Instruction Cycle Timings 


| on || 2ntnd [inte | 2ntnd+p | 

pon | antnd | nt | 3ntnd+p | 

[eee 2 | SS Se net 

Pv 3 | WA_| 32p |N/A N/A NA 
2+p (ext-to-int) 2+2p (ext-to-ext) not repeatable 

2 | 2 | 2+ | 2 || 2 | 2n | 2n+p | anep | 


Table in internal program memory: 
3 3+d 3+p 3+d+p 

Table in external program memory: 

3+p 4+d+p 3+2p 4+d+2p 


Table in internal program memory: 
2+n 2+n+nd 2+n+p 2+n+nd+p 

Table in external program memory: 

2+nt+np 2+2nt+nd 2+n+nptp 2+2n+nd 

+np +np+p 

Data source internal:t 
2+n 2+n+nd 2+n+2p 2+n+nd 

+2p 


Data source internal:t 
3 3+d 3+2p 3+d+2p 


Data source external:t 
2+ntnd 2+2n+2nd 2+n+nd 2+2n+2nd 
+2p +2p 


Data source external:t 
3+d 4+2d 3+d+2p 4+2d+2p 


Program source internal:t 
2+n 2+n+nd 2+n+2p 2+n+nd 
+2p 


Program source internal:t 
3 3+d 3+2p 3+d+2p 


Program source external:t . 
2+tnt+np 2+2ntnd 2+n+np 2+2n+nd 
+np +2p +np+2p 


Program source external:t 
3+p 4+d+p 3+3p 4+d+3p 


Zdep 


Xl 1 (minimum 1+p (minimum not repeatable 
waits for INT) waits for INT) 


tColumn headings ‘DI/DE’ refer to data destination. 
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D.2 TMS320C25 Instruction Cycle Timings 


Table D-3 lists the TMS320C25 instructions according to cycle classification. 
Table D-4 and Table D-5 show the number of cycles required for a given 
TMS320C25 instruction to execute in a given memory configuration when 
executed as a single instruction or in the repeat mode, respectively. 


Table D-3. TMS320C25 Instructions by Cycle Class 


INSTRUCTION 


ADDC ADDH ADDS ADDT AND BIT BITT DMOV LAC 
LACT LPH LT LTA LTD LTP LTS MPY MPYA MPYS 
MPYU PSHD' OR RPT SQRA SQRS- SUB SUBB SUBC SUBH 
SUBS SUBT XOR ZALH ZALR ZALS (RPT not repeatable) 


LAR LDP LST LST1 
POPD SACH SACL SAR SPH SPL SST SST1 


ADDK ADRK APAC CMPL CMPR CNFD- CNFP __ DINT EINT 
FORT LACK LARK LARP  LDPK MAR MPYK NEG NOP NORM 
POP PUSH RC RFSM RHM ROL ROR ROVM  RPTK 
RTC RTXM RXF SBRK SC SFL SFR SFSM SHM 
SOVM SPAC SPM SSXM_ STC STXM SUBK_ SXF ZAC 
(ADDK, ADRK, LACK, LARK, LDPK, MPYK, RPTK, SBRK, SPM, SUBK, and ZAC 
not repeatable) 


MACD 


BANZ BBNZ  BBZ BC BGEZ BGZ BIOZ BLEZ BLZ 
BNV BNZ BV BZ CALL (All not repeatable) 


TBLW (Table in ROM not applicable) 
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Table D-4. Cycle Timings for Cycle Classes When Not in Repeat Mode 


CLASS PI/DI PI/DE PE/DI PE/DE PR/DI PR/DE 
(Ree ae = 


Table in on-chip RAM: 
3 4+d 5+d+2p 
Table in on-chip ROM: 
4 5+d 5+d+2p 
Table in external memory: 
4+p 5+d+p 5+d+3p 
True Conditions: 
Destination on-chip RAM: 
2 2 
Destination on-chip ROM: 
3 3 
Destination external memory: 
3+p 3+p 
False Condition: 
Destination anywhere: 
2 2 
Destination on-chip RAM: 
2 2 
Destination on-chip ROM: 
3 3 
Destination external memory: 
3+p 3+p 3+2p 


Table in on-chip RAM: 
2 2+d 
Table in on-chip ROM: 
3 3+d 
Table in external memory: 
3+p 3+d+p 
Table in on-chip RAM: 
2 3+d 
Table in on-chip ROM: 
not applicable 


Table in external memory: 
2+p 3+d+p 3+2p 4+d+2p 
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Table D-4. -Cycle Timings for Cycle Classes When Not in Repeat Mode 
(Concluded) 


CLASS| _ PI/D! PI/DE PE/DI PE/DE PR/DE 


XII | Source data in on-chip RAM: 
3 3+d 3+2p 3+d+2p 
Source data in external memory: 
4+d 4+2d 4+d+2p 4+2d+2p 


(Interrupt) destination on-chip ROM 
3 (minimum waits for INT) 
(Interrupt) destination external memory 
3+2p (minimum waits for INT) 


XIV Table in on-chip RAM: 
3 3+d 4+d+2p 
Table in on-chip ROM: 
4 4+d 4+d+2p 
Table in external memory: 
4+p 4+d+p 4+d+3p 
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Table D-5. Cycle Timings for Cycle Classes When in Repeat Mode 


pn] aneng | np | anenctp | nd 
Pn nend [np | tenendtp | nnn 
ew ie eee ee ee 


Vi Table in on-chip RAM: ' 
2+n 2+2n+nd 3+n+2p 3+2n+nd+2p 3+n 3+2n+nd 


ASS 
Vv 
Vv 


Ee 
pee 
P| 
Lees 
as 


Table in on-chip ROM: 

3+n 3+2n+nd 3+n+2p 3+2n+nd+2p 3+n 3+2n+nd 
Table in external memory: : 
3+n+np 3+2ntnd+np 3+ntnp+2p 3+2n+nd+np 3+n+np 3+2n+nd+np 


+2p 


Vill not repeatable 


+ni 
+ni 


X Table in on-chip RAM: 


1+n 1+n+nd 2+nt+p 2+n+ndt+p 2+n 2+n+nd 
Table in on-chip ROM: 

2+n 2+n+nd 3+n+p 3+n+ndt+p 3+n 3+n+nd 
Table in external memory: 
2+n+np 1+2n+ndtnp 3+nt+nptp 2+2n+nd+np 3+n+np 2+2n+ndt+np 


+p 


Table in on-chip RAM: 
1+n 2+n+nd 2+n+p 3+n+ndt+p 2+n 3t+n+nd 


Table in on-chip ROM: 


not applicable 


Table in external memory: 
1+n+np 1+2n+nd+np 


Source data in on-chip RAM: 


2+nt+nptp 2+2n+nd+nptp 2+n+np- 2+2n+nd+np 


2t+n 2+n+nd 2+n+2p 2+n+nd+2p . 2+n 2+n+nd 
Source data in external memory: 
3+n+nd 2+2n+2nd 3+nt+nd+2p 2+2n+2nd+2p 3+n+nd 2+2n+2nd 


Table in on-chip RAM: 

2+n 2+n+nd 3+n+2p 3+n+nd+2p 3+n 3+n+nd 
Table in on-chip ROM: 

3+n 3+n+nd 3+n+2p 3+n+nd+2p 3+n 3+n+nd 
Table in external memory: 
3+n+np 2+2n+ndt+np 3+n+npt+2p 2+2n+ndt+np 3+n+np 2+2n+nd+np 


+2p 
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E. Development Support/Part Order Information 


This section provides development support information, device part numbers, 
and support tool ordering information for all TMS320C2x (second-generation 
TMS320) products. Figure E-1 shows the software and hardware develop- 
ment tools available for the TMS320C2x, including the development environ- 
ment when using the C compiler (see Section E.1.4). 


ASSEMBLER 
c SOURCE 
COMPILER CODE DFDP 


Ea 2 ?. 
j | | 


| 9. ASSEMBLER 
LINKER 


SIMULATOR ~ swos 


Figure E-1. TMS320C2x Development Tools 


TARGET = 
H/W 
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Extensive documentation, including application reports, user’s guides, and 
textbooks, is available to support DSP design, research, and education. To 
order TMS320 literature, contact the Tl Customer Response Center (CRC) 
hotline number, 1-800-232-3200. For more information about support pro- 
ducts and documentation, refer to the 7MS320 Family Development Support 
Reference Guide. 


The nearest TI field sales office can be contacted for support too! availability 
or further details (see list of sales offices and distributors at end of book). For 
technical support, contact the TMS320 DSP hotline, (713) 274-2320. 


The major topics discussed in this section are listed below. 


t Development Support (Section E.1 on page E-3) 

TMS320C2x Macro Assembler/Linker 

- TMS320C2x Simulator 

- TMS320C2x SoftWare Development System (SWDS) 

- TMS320C25 C Compiler 

- TMS320C2x Emulator (XDS/22) 

- TMS320C2x XDS/22 Upgrade 

—- TMS320 Analog Interface Board 

-  TMS320 Design Kit 

= Digital Filter Design Package (DFDP) 

- DSP Software Library 

- TMS320 DSP Hotline/Bulletin Board Service 


6 Part Order Information (Section E.2 on page E-11) 
= Device part numbers 
= Software and hardware support tools part numbers 
= Device and support tool prefix designators 
oor Device and support tool nomenclature 
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E.1 Second-Generation TMS320 Development Support 


Texas Instruments offers extensive development support and complete doc- 
umentation with the second-generation TMS320 digital signal processors. 
Tools are provided to evaluate the performance of the processors, develop al- 
gorithm implementations, and fully integrate the design’s software and hard- 
ware modules. Development operations are performed with the TMS320C2x 
Macro Assembler/Linker, Simulator, SoftWare Development System (SWDS), 
C Compiler, Emulator (XDS), and other support products. 


A description and key features for each TMS320C2x development support 
tool is provided in the following subsections. For more information about 
support products, refer to the 7VS320 Family Development Support Refer- 
ence Guide. For ordering information, see Section E.2. 


E.1.1 TMS320C2x Macro Assembler/Linker 


The TMS320C2x Macro Assembler translates TMS320C2x assembly language 
source code into executable object code. The assembler allows the program- 
mer to work with mnemonics rather than hexadecimal machine instructions 
and to reference memory locations with symbolic addresses. The macro as- 
sembler supports macro calls and definitions along with conditional assembly. 


The TMS320C2x Linker permits a program to be designed and implemented 
in separate modules that will later be linked together to form the complete 
program. The linker resolves external definitions and references for relocatable 
code, creating an object file that can be executed by the TMS320C2x Simu- 
lator, Emulator, or DSP device. The output of the linker can be downloaded 
into the simulator, XDS, SWDS, or PROM programmer. 


The following key features distinguish the TMS320C2x Macro 
Assembler/Linker: 


Macro capabilities and library functions 
Conditional assembly 

Relocatable modules 

Complete error diagnostics 

e Symbol table and cross reference. 


The macro assembier/linker is currently available for the VAX/VMS and 
MS/PC-DOS operating systems. 
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E.1.2 TMS320C2x Simulator 


The TMS320C2x Simulator is a software program that simulates operation of 
the TMS320C2x to allow program verification. In the debug mode, the state 
of the simulated TMS320C2x can be monitored while the program is execut- 
ing. The simulator uses the object code produced by the TMS320C2x Macro 
Assembler/Linker. During program execution, the internal registers and mem- 
ory of the simulated device are modified as each instruction is interpreted by 
the host computer. Once program execution is suspended, the internal reg- 
isters and both program and data memories can be inspected and/or modified. 
In addition, files can be associated with the I/O ports. 


The following features highlight simulator capability for effective TMS320C2x 
software development: ; 


Program debug/Vverification 

Single-step option 

Trace/breakpoint capabilities 

Full access to simulated registers and memories 
1/O device simulation. 


The simulator is currently available for the VAX/VMS and MS/PC-DOS oper- 
ating systems. 


E.1.3 TMS320C2x SoftWare Development System (SWDS) 
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The SoftWare Development System (SWDS) is a PC-resident too! that allows 
software simulation in realtime for the TMS320C2x. The SWDS provides the 
system interface necessary to write, assemble/link, load, and debug the 
TMS320C2x code on a PC workstation. The SWDS is capable of single- 
stepping through the code or setting software breakpoints for monitoring 
register or memory contents during execution. It can also associate files with 
1/O ports so that specific 1/O values may be used during test and debug. 


The SWDS consists of three parts: 


1) A circuit board, resident in the PC, that contains the TMS320C2x and 
program and data memory. 

2) Two small cable adaptor boards situated outside the PC and connected 
to the SWDS via two 40-conductor ribbon cables. The cable adaptor 
boards included with the system are: 

a) The PGA Adaptor Connector that connects the SWDS to a 
TMS320C2x target system via a 68-pin grid array footprint. 

b) The Analog Interface Board (AIB) Adaptor Connector that con- 
nects the SWDS directly to the TMS320 AIB. 

3) Software that includes TMS320C2x assembler/linker software, the DSP 
Software Library (see Section E.1.10), and SWDS monitor software. 


The SWDS is designed to function in the IBM-PC/AT and compatible envi- 
ronment, as well as in any Tl PC environment (including the Tl Business Pro). 
MS-DOS version 2.0 or later is required. 
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The development system occupies 64K bytes of PC memory. It is equipped 
with 24K words (48 kbytes) of static RAM, and allows the TMS320C2x to 
execute at full speed. Note that the SWDS does not address target memory. 


The SWDS is configured to emulate the TMS320C25 upon shipment; i.e., a 
TMS320C25 and a 40-MHz oscillator are on-board. A TMS32020 and a 
20-MHz crystal are included with the system to accommodate TMS32020 
emulation. The target system may supply a TTL clock source, in which case 
the upper limit on the clock speed is dictated by the speed of the processor 
on the PC board. If the user’s target system has no provision for a clock 
source, the external clock is specified in the debug monitor initialization com- 
mand and an external crystal is connected to the SWDS. 


E.1.4 TMS320C25 C Compiler 


A full Kernigan and Ritchie C compiler is provided for the TMS320C25. The 
compiler accepts a digital signal processing program written in the widely 
used C language and outputs TMS320C2x assembly language source code. 
The TMS320C2x mnemonics are then converted to object code by a 
TMS320C2x assembler. 


This high-level language compiler allows time-critical routines written in as- 
sembly language to be called from within the C program. The converse is also 
available; assembly routines may call C functions. The output of the compiler 
can be edited prior to assembly/link to further optimize the performance of the 
code. The compiler is also capable of accepting other programs written in C. 
Refer back to Figure E-1 for a diagram of the development environment when 
using the C compiler. 


Included with the shipment of the TMS320C25 C compiler is an enhanced 
assembler/linker. The output of this assembler/linker can be downloaded and 
used with any of the existing tools (simulator, XDS, SWDS, or PROM pro- 
grammer). 


The compiler is currently available for the VAX/VMS and MS/PC-DOS oper- 
ating systems. 


E.1.56 TMS320C2x Emulator (XDS) 


The TMS320C2x Emulator (XDS/22) is a user-friendly system that has all the 
features necessary for realtime in-circuit emulation. This allows integration of 
hardware and software modules in the debug mode. By setting breakpoints 
based on internal conditions or external events, execution of the program can 
be suspended and control given to the debug mode. In the debug mode, all 
registers and memory locations can be inspected and modified. Single-step 
execution is available. Full-trace capabilities at full speed and a reverse as- 
sembler that translates machine code back into assembly instructions also in- 
crease debugging productivity. Using a standard RS-232-C port, the object 
file produced by the TMS320C2x Macro Assembler/Linker can be down- 
loaded into the emulator, which then can be controlled through a terminal. 


Two 4K x 16-word banks of high-speed static RAM can be mapped into a 
fixed address space starting at 0 for both program and data memory. Also 
available are 64K words of dynamic RAM, which can be mapped into the us- 
er’s program and data address spaces. The XDS is capable of executing out 
of target memory to utilize the full TMS320C2x program/data address range. 
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For multiprocessing configurations, up to nine emulators can be daisy-chained 
together. 


The XDS/22 emulator is a completely self-contained system with power sup- 
ply. With three RS-232-C ports, the XDS/22 Emulator can be interfaced to a 
terminal, host computer for source or object downloading/uploading capabil- 
ities, and printer or PROM programmer. 


The key features of the TMS320C2x XDS/22 Emulator are as follows: 


40-MHz full-speed in-circuit emulation 

Supports all second-generation TMS320 family members 
PLCC target connector with pin grid array (PGA) adaptor 

4K words each of program and data memory (zero wait states) 
64K-word PROM memory expansion board (wait states) 
Breakpoint, trace, and timing (BTT) capabilities 

Single-step execution 

Line-by-line assembler/reverse assembler 

Enhanced decimal parameter entry and display 

Use of target system CLKIN signal or internal crystal 


Host-independent upload/download capabilities to/from program or 
data memory 


Ability to inspect and modify registers and program/data memory 
Supports multiprocessor configurations 
Logic tracing with extended data/address logic analyzer interface. 


Figure E-2 shows a block diagram of a typical system configuration using the 
TMS320C2x XDS/22 Emulator. 
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Figure E-2. TMS320C2x XDS/22 System Configuration 
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E.1.6 TMS320C2x XDS/22 Upgrade 


Texas Instruments offers a TMS320C2x XDS upgrade kit, which can extend 
the functionality of existing development systems at a minimum of cost 
through an enhancement of current customer equipment. For example, the 
upgrade kit can enable a TMS32020 XDS/22 to emulate operation of both 
second-generation devices. Upgrade kits allow upgrade only within a gener- 
ation, not from a first- to a second-generation XDS. 


The TMS320C2x XDS upgrade kit consists of the following contents: 


) Firmware (2 PALs, 2 EPROMs) 
TMS320C25 and crystal 

4K x 16 high-speed static RAM (2 sets) 
40-MHz breakpoint, trace, and timing board 
PGA/PLCC target connector. 


E.1.7 TMS320 Analog Interface Board 


The TMS320 Analog Interface Board (AIB) is an analog-to-digital, digital- 
to-analog conversion board used as a preliminary target system with either the 
TMS320C2x SWDS, XDS, or another emulator (see Figure E-3). The AIB is 
an educational tool that provides a simple, inexpensive way to become familiar 
with digital signal processing (DSP) techniques. 


The AIB allows testing of application programs with analog !/O by providing 
an interface to the TMS320C2x. The AIB provides 12-bit A/D and D/A con- 
verters with expansion ports for additional A/D and D/A converters. Key 
features of the AIB are as follows: 


e 12-bit analog-to-digital converter with sample and hold 

12-bit digital-to-analog converter 

One 16-bit output port for additional D/A or user-defined application 
One 16-bit input port for additional A/D or user-defined application 
Two lowpass filters; an audio amplifier 

@ Prototyping area for user applications. 


The AIB runs at full speed up to 20 MHz for TMS320 family members. The 
AIB sample rate clock, derived from the TMS320 CLKOUT signal, may be 
programmed to provide periodic analog input, output, or both. There are two 
analog lowpass filters on the AIB. One filter on the A/D input band-limits the 
input to minimize aliasing effects. The other filter smooths the output of the 
D/A. Filter frequency response is controlled by varying the external compo- 
nents in the filter stages. Filter cutoff is set to 4.7 kHz, but may be (plug) 
programmed. An audio ampiitier that drives an 8-ohm speaker is provided for 
applications with audio output. 


An AIB adaptor board is required to convert the 40-pin dual-inline socket for 
the TMS320C1x to accommodate the 68-pin grid array package of the 
TMS32020. An additiona! adaptor socket is necessary for TMS320C25 oper- 
ation. Contact the nearest Ti Sales Office for a list of commercially available 
adaptor socket vendors. 
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POWER 
SUPPLY 


POWER 


_OR CABLE 
ANALOG 
USER'S TMS320C2x TMS320 OuT 
TERMINAL XDS AIB sukbG 
IN 


Figure E-3. TMS320 AIB System Configuration 


E.1.8 TMS320 Design Kit 


The TMS320 DSP Design Kit has been created by Texas Instruments to aid 
the user in becoming familiar with the TMS320 family of digital signal pro- 
cessors, thus accelerating the evaluation of these devices. The kit contains the 
following: 


® Samples: one TMS32020GBL, one TMS32010NL, one Codec 
(TCM2916), and four preprogrammed PROMs (TBP38L165-45). 


e ADPCM Design Example using the TMS32010. 
C FFT Design Example using the TMS32020. 


® Digital Signal Processing Applications with the TMS320 Family, a 
comprehensive 750-page book filled with TMS320 applications. 

© Digital Signal Processing Software Library, containing source code for 
most of the DSP applications discussed in the Applications Book as well 
as other valuable routines. 


e TMS320C1x and TMS320C2x User’s Guides. 


° Latest copy of the TMS320 quarterly newsletter, Details on Signal Pro- 
cessing. 


The Design Kit is available through jocal T! authorized distributors or directly 
from Texas Instruments. Contact the nearest Tl Sales Office for more infor- 
mation. 
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E.1.9 Digital Filter Design Package (DFDP) 


The Digital Filter Design Package (DFDP) from Atlanta Signal Processors, Inc. 
(ASPI) is a user-friendly, menu-driven software package intended to speed 
the design of digital filters with floating-point accuracy or fixed-point econ- 
omy in a variety of filter structures. The package consists of four interactive 
filter design modules capable of performing the following functions: 


1) Designing FIR filters (Kaiser window) 

2) Designing FIR filters (Parks- McClellan) 

3) Designing IIR filters (Butterworth, Chebychev | and II, and elliptic) 

4) Generating TMS320C1ix or TMS320C2x assembly code by converting 
the ASCII file containing the filter coefficients into fully commented as- 
sembly language code for those devices. 


Cascade and parallel structures as well as higher-performance lattice, normal- 
ized lattice, and orthogonal forms are included in the modules. 


The DFDP can design filters to meet any piecewise linear response specifica- 
tion, evaluate filter characteristics before and after coefficient quantization, 
and design special-purpose FIR filters, such as multiband filters, differentia- 
tors, Hilbert transformers, and raised-cosine filters. The DFDP can also gen- 
erate coefficients for filter implementations on any general-purpose processor 
or signal processing chip, as well as fully commented assembly language code 
for a variety of DSP chips. Magnitude, log magnitude, and impulse responses 
can be plotted for printer or screen display; in addition, the phase, group delay, 
and pole-zero map can be plotted for IIR filters. After the filter is designed, 
the user can generate code associated with the filter using the CGEN design 
module. 


The DFDP runs on the IBM PS/2, IBM PC/XT/AT, and compatible systems. 
Operating systems must have 192 kbytes of memory available. For more in- 
formation, contact Atlanta Signal Processors, Inc. (404-892-7265) or the 
nearest TI field sales office. 


E.1.10 DSP Software Library 


The Digital Signal Processing Software Library contains the major DSP rou- 
tines (FFT, FIR/IR filtering, and floating-point operations) and application 
algorithms (echo cancellation, ADPCM, and DTMF coding/decoding) pre- 
sented in the book, Digital Signal Processing Applications with the TMS320 
Family. These routines and algorithms are written in either TMS320C1x 
and/or TMS320C2x source code. In addition, macros for the TMS320C1x are 
included in the library. 


The software package consists of four diskettes for use with the TI/IBM 
MS/PC-DOS (version 1.1 or later) or a 1600 BPI magnetic tape for the 
VAX/VMS version. All the directories on the MS/PC-DOS version are con- 
tained on the magnetic tape for the VMS version. Each directory contains a 
README.LIS file briefly describing the contents of the files in the directory 
and the reference to the code. The book, Digital Signal Processing Applica- 
tions with the TMS320 Family, is the major reference for the theory and al- 
gorithms, and also provides printed code in the appendices of each application 
report. 
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The software library and applications book are included in the purchase of a 
TMS320 Design Kit (see Section E.1.7). The library can also be ordered sep- 
arately through Tl (see Table E-2 for ordering information). 


All the software in the library is copyrighted by Texas Instruments. The library 
is continually being updated; therefore, check the TMS320 DSP Bulletin 
Board (713-274-2323) for update information. 


E.1.11 TMS320 DSP Hotline/Bulletin Board Service 


The TMS320 group at Texas Instruments provides a DSP Hotline to answer 
TMS320 technical questions such as device problems, development tools, 
documentation, upgrades, and new TMS320 products. The hotline is open 
five days a week from 8:00 AM to 4:30 PM Central Time. The phone number 
is (713) 274-2320. For pricing and availability of TMS320 devices and de- 
velopment tools, contact the nearest TI sales office. To order literature, call the 
Customer Response Center (CRC) at (800) 232-3200. 


The TMS320 DSP Bulletin Board Service is a telephone-line computer bulletin 
board that provides access to information pertaining to TMS320 devices. 


- Specification updates for current or new TMS320 devices and development 


tools are communicated via the bulletin board as the information becomes 
available. The Bulletin Board Service can be accessed by dialing (713) 
274-2323 with a 1200-bps modem. 


The bulletin board contains TMS320 source code from the application reports 
included in the book, Digital Signal Processing Applications with the TMS320 
Family. The bulletin board also provides new DSP application software as it 
becomes available. See the 7WS320 Family Development Support Reference 
Guide for information on how to access the bulletin board. 
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E.2 Part Order Information 


This section provides the device and support tool part numbers. Table E-1 
lists the part numbers for all the second-generation members of the TMS320 
family. Table E-2 gives ordering information for TMS320C2x hardware and 
software support tools. Table E-3 provides a list and description of the de- 
velopment tool connections to a target system. A discussion of the TMS320 
family device and development support tool prefix and suffix designators is 
included to assist in understanding the TMS320 product numbering system. 


Table E-1. TMS320C2x Digital Signal Processor Part Numbers 


se ase | SOR | TET 
DEVICE TECHNOLOGY FREQUENCY TYPE DISSIPATION 


tMilitary version available. 
tMilitary version planned; contact nearest sales office for availability. 


Table E-2. TMS320C2x Support Tool Part Numbers 


TOOL DESCRIPTION OPERATING SYSTEM| PART NUMBER 
SOFTWARE 


Macro Assembler/Linker VAX VMS TMDS3242210-08 
MS/PC-DOS TMDS3242810-02 
Simulator VAX VMS TMDS3242211-08 
MS/PC-DOS TMDS3242811 -02 

SoftWare Development System MS/PC-DOS TMDS3268821 


C Compiler (TMS320C25) VAX VMS TMDS3242255-08 
MS/PC-DOS TMDS3242855-02 

DSP Software Library VAX VMS TMDC3240212-18 | 
MS/PC-DOS TMDC3240812-12 


TMDS3262221 
Customer Upgradet 


TMS320 Design Kit 


tSee Table E-3 for a list of connections to a target system. 
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Table E-3. Development Tool Connections to a Target System 


TOOL TARGET CONN. | INCL. PART NUMBER 


TMS320C25 XDS/22 | 68-pin PGA/PLCC X 
68-pin PGA/PLCC X TMDX3288825 
68-pin PGA X TMDX3288820 

TMS320C25 XDS/22 | 68-pin PGA/PLCC X 

Upgrade 68-pin PGA/PLCC X TMDX3288825 
68-pin PGA X TMDX3288820 


E.2.1 Device and Development Support Tool Prefix Designators 


To assist the user in understanding the stages in the product development 
cycle, Texas Instruments assigns prefix designators in the part number no- 
menclature. A device prefix designator has three options: TMX, TMP, and 
TMS, and a development support tool prefix designator has two options: 


_ TMDX and TMDS. These prefixes are representative of the evolutionary stages 


of product development from engineering prototypes (TMX/TMDX) through 
fully qualified production devices (TMS/TMDS). This development flow is 
defined below. 


Device Development Evolutionary Flow: 


TMX _ Experimental device that is not necessarily representative of the final 
device's electrical specifications. 


TMP Final silicon die that conforms to the device’s electrical specifications 
but has not completed quality and reliability verification. 


TMS Fully qualified production device. 


Support Tool Development Evolutionary Flow: 


TMDX Development support product that has not yet completed Texas In- 
struments internal qualification testing. 


TMDS_ Fully qualified development support product. 


TMX and TMP devices and TMDX development support tools are shipped 
against the following disclaimer: 


“Developmental product is intended for internal evaluation purposes.” 


Note: 


Texas Instruments recommends that prototype devices (TMX or TMP) not 


be used in production systems since their expected end-use failure rate is 
undefined but predicted to be greater than standard qualified production 
devices. 
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TMS devices and TMDS development support tools have been fully charac- 
terized and the quality and reliability of the device has been fully demon- 
strated. Texas Instruments standard warranty applies. 


E.2.2 Device and Development Support Too! Nomenclature 


In addition to the prefix, the device nomenclature includes a suffix that follows 
the device family name. This suffix indicates the package type (e.g., N, FN, 
or GB) and temperature range (e.g., L). Figure E-4 provides a legend for 
reading the complete device name for any TMS320 family member. 


TMS 320 C 25 GB L 


PREFIX eae 


L TEMPERATURE RANGE 


TMX = experimental device 
TMP = prototype device 
TMS = qualified device 
SMJ = MIL-STD-883C 


DEVICE FAMILY 
320 = TMS320 family 


TECHNOLOGY 
C = CMOS 

E = CMOS EPROM 
No letter = NMOS 


DEVICE 
1st-gen. DSP: 
10 
11 
15 
17 
2nd-gen. DSP: 
20 


25 
3rd-gen. DSP: 
30 


L= Oto 70°C 

S = -55 to 100°C 
= -55 to 125°C 

A = -40 to 85°C 


PACKAGE TYPE 

N = plastic DIP 

JD = ceramic DIP 
side-brazed 

FN = plastic leaded CC 

GB = ceramic PGA 

FJ = ceramic leaded CC 

FD = leadless ceramic CC 


Figure E-4. TMS320 Device Nomenclature 
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Figure E-5 provides a legend for reading the part number for any TMS320 
hardware or software development tool. 


TMDS 32 4 28 10-0 2 


QUALIFICATION Steel L MEDIUMT 
TMDX = prototype 2 = 5 1/4” floppy disk 
TMDS = qualified 8 = 1600 BPI magnetic tape 
DEVICE FAMILY S/W FORMATtT 
32 = TMS320 family O = object code 

1 = source code 
PRODUCT TYPE 
4 = software SEQUENCE NUMBER# 
6 = hardware 
8 = upgrade 

GENERATION 

1 = 1st gen. 

2 = 2nd gen. 

3 = 3rd gen. 
SOFTWARE OPERATING SYSTEM or HARDWARE MODEL 
02 = 1st-gen. VAX VMS 11 = XDS/11 
08 = 1st-gen. TI/IBM MS/PC-DOS 22 = XDS/22° 
22 = 2nd-gen. VAX VMS 88 = target connector 


28 = 2nd-gen. TI/IBM MS/PC-DOS 
32 = 3rd-gen. VAX VMS 
38 = 3rd-gen. TI/IBM MS/PC-DOS 


t Software only. 
+ Hardware only. 


Figure E-5. TMS320 Development Tool Nomenclature 


F. Memories, Analog Converters, Sockets, and Crystals 


This appendix provides product information regarding memories, analog con- 
verters, and sockets, which are manufactured by Texas Instruments and com- 
patible with the TMS320C2x. Information is also given regardirig crystal 
frequencies, specifications, and vendors. 


The contents of the major areas in this appendix are listed below. 


Tl Memories and Analog Converters (Section F.1 on page F-2) 
- EPROM memories 

a Codecs and filters 

- Analog interface circuits 

- A/D and D/A converters. 


Tl Sockets for PGA and PLCC Packages (Section F.2 on page F-28) 
- Production sockets 
- Burn-in/test sockets. 


Crystals (Section F.3 on page F-33) 

= Commonly used crystal frequencies 
- Crystal specification requirements 
= Vendors of suitable crystals. 
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F.1 Tl Memories and Analog Converters 


F-2 


This section provides pages of product information taken from data sheets for 
EPROM memories, codecs, analog interface circuits, and D/A and D/A con- 
verters. 


All of these devices can be interfaced with TMS320C2x processors (see Sec- 
tion 6 for hardware interface designs). Refer to Digital Signal Processing Ap- 
plications with the TMS320 Family for additional information on interfaces 
using memories and analog conversion devices. 


The following paragraphs give the name of each device and where the data 
sheet for that device is located in order to obtain further specification infor- 
mation if desired. 


Data sheets for EPROM memories are located in the MOS Memory Data Book 
(SMYDOO6). The name of the device and the page number in the book on 
which the device is introduced are listed. 


TMS27C64 (page 6-21) 
TMS27C128 (page 6-29) 
TMS27C256 (page 6-37) 
TMX27C512 (page 6-45) 


Another EPROM memory, TMS27C291/292, is described in a data sheet 
(SMLS291A). 


The TCM29C13/14/16/17 codecs and filters are described in the data sheet 
beginning on page 2-111 of the 7elecommunications Circuits Data Book 
(SCT001). An analog interface for the DSP using a codec and filter is pro- 
vided by the TCM29C18/19 (data sheet number SCTQ21). 


The data sheet for the TLC32040 analog interface circuit is provided in the 
Interface Circuits Data Book, beginning on page 2-271. 


In the same book are data sheets for A/D and D/A converters. The name of 
the device and the page on which it is introduced are as follows: 


TLC0820 (page 2-113) 
TLC1205/1225 (page 2-181) 
TLC7524 (page 2-243) 


ADVANCE | TMS27C64 
INFORMATION 65,536-BIT ERASABLE PROGRAMMABLE READ-ONLY MEMORY 


SEPTEMBER 1985 


Organization... 8K x 8 J PACKAGE 
(TOP VIEW) 
Single 5-V Power Supply 


Pin Compatible with Existing 64K EPROMs 
and TMS2732A 


@ All Inputs/Outputs Fully TTL Compatible 


@ Max Access/Min Cycle Time 
‘'27C64-1, ‘'27C64-15 150ns 
'27C64-2, ‘'27C64-20 200ns 
'27C64, '27C64-25 250ns 
'27C64-3, ‘27C64-30 300ns 
'27C64-4, ‘27C64-45 450ns 


HVCMOS Technology 
3-State Output Buffers 


400 mV Guaranteed DC Noise Immunity 
with Standard TTL Loads 


@ Low Power Dissipation (Vcc = 5.25 V) 
—Active .... 158 mW Worst Case Address Inputs 
—Standby ... 1.4 mW Worst Case Chip Enable/Power Down 

(CMOS-Input Levels) Output Enable 
Ground 
No Connection 


Program 


Outputs 
5-V Power Supply 
12.5-V Power Supply 


description 


The TMS27C64 series are 65,536-bit, ultraviolet-light erasable, electrically programmable read-only 
memories. These devices are fabricated using HVCMOS technology for high speed and simple interface 
with MOS and bipolar circuits. All inputs (including program data inputs) can be driven by Series 74 TTL 
circuits without the use of external pull-up resistors, and each output can drive one Series 74 TTL circuit 
without external resistors. The data outputs are three-state for connecting multiple devices to acommon 
bus. The TMS27C64 is pin compatible with existing 28-pin ROMs and EPROMs. It is offered in a dual-in- 
line ceramic package (J suffix) rated for operation from O°C to 70°C. 


Since these EPROMs operate from a single 5-V supply (in the read mode), they are ideal for use in 
microprocessor-based systems. One other (12.5 V) supply is needed for programming, but all programming 
signals are TTL level. For programming outside the system, existing EPROM programmers can be used. 
Locations may be programmed singly, in blocks, or at random. 


operation 


There are seven modes of operation for the TMS27C64 listed on the following page. Read mode requires 
a single 5-V supply. All inputs are TTL level except for Vpp during programming (12.5 V) and 12 V on 
AQ for signature mode. 


information on new products in the sampling or 
preproduction phase of development. Characteristic 
data and other specifications are subject to change 
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wlthost notice INSTRUMENTS 
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ADVANCE TMS27€128 
INFORMATION 131,072-BIT ERASABLE PROGRAMMABLE READ-ONLY MEMORY 


OCTOBER 1984—REVISED SEPTEMBER 1985 


Organization ... 16K x8 J PACKAGE 


(TOP VIEW) 
Single 5-V Power Supply 


Pin Compatible with Existing 64K and 
128K EPROMs 


@ All Inputs/Outputs Fully TTL Compatible 


Max Access/Min Cycle Time 

°270128-1, ‘'27C128-15 150 ns 
‘'27C0128-2, ‘'270128-20 200ns 
°27C128, °27C128-25 250 ns 
°27C128-3, ‘'27C128-30 300 ns 
°'270128-4, ‘'27C128-45 450 ns 


HVCMOS Technology 
3-State Output Buffers 


@ 400 mV Guaranteed DC Noise Immunity 
with Standard TTL Loads 


@ Low Power Dissipation (Vcc =5.25 V) 2 Address Inputs 
—Active ... 158 mW Worst Case E Chip Enable/Power Down 
—Standby ... 1.4 mW Worst Case Output Enable 


({CMOS-Input Levels) Ground 


Program 

Outputs 

5-V Power Supply 
12.5-V Power Supply 


description 


The TMS27C128 series are 131,072-bit, ultraviolet-light erasable, electrically programmable read-only 
memories. These devices are fabricated using HVCMOS technology for high speed and simple interface 
with MOS and bipolar circuits. All inputs (including program data inputs) can be driven by Series 74 TTL 
circuits without the use of external pull-up resistors, and each output can drive one Series 74 TTL circuit 
without external resistors. The data outputs are three state for connecting multiple devices to a common 
bus. The TMS27C1 28 is pin compatible with existing 28-pin ROMs and EPROMs. It is offered in a dual-in- 
line ceramic package (J suffix) rated for operation from O°C to 70°C. 


Since these EPROMs operate from a single 5-V supply (in the read mode), they are ideal for use in 
microprocessor-based systems. One other (12.5 V) supply is needed for programming, but all programming 
signals are TTL level. For programming outside the system, existing EPROM programmers can be used. 
Locations may be programmed singly, in blocks, or at random. 


Operation 


There are seven modes of operation for the TMS27C 128 listed on the following page. Read mode requires 
a single 5-V supply. All inputs are TTL level except for Vpp during programming (12.5 V) and 12 V on 
AQ for signature mode. 


Hires pleat sar Ly saprm Sage wy Copyright © 1985, Texas Instruments Incorporated 
information on new ‘products in the sampling or 
Peterlee mes Texas 
and ot! NS are si 0 chan 
wiletatke, . INSTRUMENTS 


POST OFFICE BOX 1443 ® HOUSTON, TEXAS 77001 


TMS27€256 
262,144-BIT ERASABLE PROGRAMMABLE READ-ONLY MEMORY 


SEPTEMBER 1984 — REVISED NOVEMBER 1985 


Organization ... 32K x 8 J PACKAGE 
(TOP VIEW) 


Single 5-V Power Supply 


Pin Compatible with Existing 128K and 
256K EPROMs 


All Inputs/Outputs Fully TTL Compatible 


Max Access/Min Cycle Time 

°27C256-1, ‘'27C256-17. 170 ns 
°27C256-2, ‘'27C256-20 200 ns 
°27C256, (‘'27C256-25 250 ns 
°27C256-3, ‘'27C256-30 300ns 
°27C256-4, ‘'27C256-45 450 ns 


HVCMOS Technology 
3-State Output Buffers 


400 mV Guaranteed DC Noise !mmunity 
with Standard TTL Loads 


@ Low Power Dissipation (Vcc = 5.25 V) 
—Active . .. 210 mW Worst Case Address Inputs 
—Standby ... 1.4 mW Worst Case Chip Enable/Power Down 


(CMOS-Input Levels) Output Enable 
Ground 


Outputs 
5-V Power Supply 
12.5-V Power Supply 


description 


The TMS27C256 series are 262,144-bit, ultraviolet-light erasable, electrically programmable read-only 
memories. These devices are fabricated using HVCMOS technology for high speed and simple interface 
with MOS and bipolar circuits. All inputs (including program data inputs) can be driven by Series 74 TTL 
circuits without the use of external pull-up resistors, and each output can drive one Series 74 TTL circuit 
without external! resistors. The data outputs are three state for connecting multiple devices to acommon 
bus. The TMS27C256 is pin compatible with existing 28-pin ROMs and EPROMs. It is offered in a dual-in- 
line ceramic package (J suffix) rated for operation from O°C to 70°C. 


Since these EPROMs operate from a single 5-V supply (in the read mode), they are ideal for use in 
microprocessor-based systems. One other (12.5 V) supply is needed for programming, but all programming 
signals are TTL level. For programming outside the system, existing EPROM programmers can be used. 
Locations may be programmed singly, in blocks, or at random. 


operation 


There are seven modes of operation for the TMS27C256 listed on the following page. Read mode requires 
a single 5-V supply. All inputs are TTL level except for Vpp during programming (12.5 V) and 12 V on 
AQ for signature mode. 
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ADVANCE TMS27C512 
INFORMATION 524,288-BIT ERASABLE PROGRAMMABLE READ-ONLY MEMORY 


NOVEMBER 1985 — REVISED AUGUST 1986 


Organization .. . 64K x8 J PACKAGE 
(TOP VIEW) 


Single 5-V Power Supply 


Pin Compatible with Existing NMOS 512K 
EPROMs 


All Inputs/Outputs Fully TTL Compatible 


Max Access/Min Cycle Time 

°27C512-2, ‘'27C0512-20 200ns 
°27C512, °270512-25 250 ns 
'27C512-3, ‘'27C0512-30 300ns 
'27C512-4, ‘27C512-45 450ns 


HVCMOS Technology 
3-State Output Buffers 


400 mV Guaranteed DC Noise Immunity 
with Standard TTL Loads 


Low Power Dissipation (Vcc = 5.25 V) 
—Active .. . 263 mW Worst Case 


—Standby ... 1.4 mW Worst Case Address Inputs 
(CMOS-Input Levels} Chip Enable/Power Down 
Ground 
description Outputs 
The TMS27C512 series are 524,288-bit, ey over eunpy 
ultraviolet-light erasable, electrically e2c5-y Powel Supply 
programmable read-only memories. These eutpaneneble 


devices are fabricated using HVCMOS 

technology for high speed and simple interface 

with MOS and bipolar circuits. All inputs 

(including program data inputs) can be driven by Series 74 TTL circuits without the use of external pull-up 
resistors, and each output can drive one Series 74 TTL circuit without external resistors. The data outputs 
are three state for connecting multiple devices to a common bus. The TMS27C512 is pin compatible with 
existing 28-pin ROMs and EPROMs. It is offered in a dual-in-line ceramic package (J suffix) rated for operation 
from 0°C to 70°C. 


Since these EPROMs operate from a single 5-V supply (in the read mode), they are ideal for use in 
microprocessor-based systems. One other (12.5 V) supply is needed for programming, but all programming 
signals are TTL level. For progra.arning outside the system, existing EPROM programmers can be used. 
Locations may be programmed singly, in blocks, or at random. 


operation 


F.6 


There are seven modes of operation for the TMS2.7CG 7 2 listed on the following page. Read mode requires 
a single 5-V supply. All inputs are TTL level except for Vpp during programming (12.5 V) and 12 V on 
AQ for signature mode. 
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ADVANCE | TMS270291, TMS27€292 
INFORMATION 16,384-BIT UV ERASABLE PROGRAMMABLE READ-ONLY MEMORIES 


SEPTEMBER 1986—REVISED NOVEMBER 1986 


Organization... 2K x 8 J PACKAGE 
(TOP VIEW) 


Single 5-V Power Supply 


Pin Compatible with Existing 2K x 8 
Bipolar/CMOS PROMs 


All Inputs/Outputs TTL Compatible 


High Speed 

Max Access/Min Cycle Time 
Vcoc+5% 

'27C291-3 *27C292-3 35 ns 

'27C291 '27C292 45 ns 

'27C291-5 '27C292-5 50 ns 
Vcec+10% 


'270291-45 ‘'27C292-45 45ns 
‘27€291-50 ‘27C292-50 50ns 


Low-Power CMOS Technology 


3-State Output Buffers = pations inputs 
Ground 
@ 400 mV Guaranteed DC Noise immunity ‘ Outputs 
with Standard TTL Loads $1, $2, Chip Selects 
@ Low Power Dissipation . . . 394 mW Max SM Power Supply 


Eraseable tPins 18-20 have different pin assignments and 
functions in the program mode (see page 3). 
@ 100% Pretestable 


description 


The TMS27C291 and TMS27C292 series are 16,384-bit, ultraviolet-light erasable, electrically 
programmable read-only memories. These devices are fabricated using CMOS technology for high speed 
and simple interface with MOS and bipolar circuits. All inputs (including program data inputs) can be driven 
by Series 74 TTL circuits without the use of external pull-up resistors, and each output can drive eight 
Series 74 TTL circuits without external resistors. The data outputs are three state for connecting multiple 
devices to a common bus. These devices are pin compatible with existing 24-pin PROMs and EPROMs. 
They are offered in dual-in-line ceramic packages (J suffix). The package for the TMS27C291 is designed 
for insertion in mounting-hole rows on 7,62-mm (300-mil) centers, and the package for the TMS27C292 
is designed for insertion on 15,24-mm (600-mil) centers. The TMS27C291 and TMS27C292 are guaranteed 
for operation from O°C to 70°C. 


operation 


There are eight modes of operation for the TMS27C291 and TMS27C292 listed on the following page. 
Read mode requires a single 5-V supply. All inputs are TTL or CMOS levels except for Vpp (pin 20) during 
programming (13.5 V). 


ADVANCE INFORMATION documents contain 2 Copyright © 1986, Texas instruments Incorporated 
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TMS27€291, TMS270292 
16,384-BIT UV ERASABLE PROGRAMMABLE READ-ONLY MEMORIES 


FUNCTION 
(PINS) 


S1/Vppt 
(20) 
S2/VFYt 
(19) 
S3/PGMt 
(18) 

Vcc 
(24) 


Read Output Output Output Program Program Fast Blank Check|Blank Check si 
ee Disable Disable Disable Verify Inhibit Program Ones Zeros ignature 


Pin assignment for program mode. 
+X can be Vic or VIH- 
5Programming levels for Vi_ and VjH. 


read/output disable 


When the outputs of two or more TMS27C291’s or TMS27C292’s are connected in parallel on the same 
bus, the output of any particular device in the circuit can be read with no interference from the competing . 
outputs of the other devices. To read the output of the TMS27C291 or TMS27C292, a low-level signal 
is applied to S1 and a high-level signal is applied to S2 and S3. Any other combination of logic states on 
these three inputs will disable the outputs. Output data is accessed at pins Q1 through O8. 


erasure 


Before programming, the TMS27C291 or TMS27C292 is erased by exposing the chip through the 
transparent lid to high intensity ultraviolet light (wavelength 2537 angstroms). The recommended minimum 
exposure dose (UV intensity x exposure time) is fifteen watt-seconds per square centimeter. A typical 
12 milliwatt per square centimeter, filterless UV lamp will erase the device in 21 minutes. The lamp should 
be located about 2.5 centimeters above the chip during erasure. It should be noted that normal ambient 
light contains the correct wavelength for erasure. Therefore, when using the TMS27C291 or TMS27C292, 
the window should be covered with an opaque label. 
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TEM29C13, TCM29C14, TCM29C16, TCM29C17 
COMBINED SINGLE-CHIP PCM CODEC AND FILTER 


02765, APRIL 1986 


@ Replaces Use of TCM2910A and 
TCM2911A in Tandem with TCM2912B/C 


Reliable Silicon-Gate CMOS Technology 


FEATURE TABLE 


[Feature [28013] 29074 [escv6| 29077] 


Number of Pins: 


Low Power Consumption: . 
Operating Mode. . . 80 mW Typical v6 
Power-Down Mode... 5 mW Typical Mawiinlawieeaing: 
@ Excellent Power Supply Rejection Ratio Over pelaw 
Frequency Range of O to 50 kHz A-law 
Data Timing Rates: 
@ No External Components Needed for Variable Mode 
Sample, Hold, and Auto-Zero Functions 64 kHz to 2.048 MHz 
ne: Fixed Mode 
Precision Internal Voltage References 1.536 MHz 
Direct Replacement for Intel 2913, 2914, 1.544 MHz 
2916, and 2917 2.048 MHz 
Loopback Test Capability 
@ Formerly TCM4913, TCM4914, TCM4916, 


TCM4917, Respectively 


8th-Bit Signaling 


TCM29C13 TCM29C14 TCM29C16, TCM29C17 
J DUAL-IN-LINE PACKAGE J DUAL-IN-LINE PACKAGE J DUAL-IN-LINE PACKAGE 
(TOP VIEW) (TOP VIEW) (TOP VIEW) 

Vea C1 U2} vec Ves Ut U2) vec vepO1 Orel Vcc 
pwro+{{2 191] Gsx pwro+ {2 23 ]Gsx pwro+(]2 15, ]GSx 
pwro- []3  18f] ANLG IN- pwro- (3 22 JANLG IN - Pwro-[]3 14[ J ANLG IN- 

Gsr {ls 7f] ANLG IN+ GsR(\4 —21f JANLG IN+ PON([}4 13{] ANLG GND 

PDN(Js 16] ANLG GND PON(}s —20[ JANLG GND DCLKR[]s — 12 ] TSX/DCLKX 
CLKSEL[]6 15, ]ASEL cLKSEL Cs 19 ]NC PCMIN(|s 11 PCM OUT 

DcLKRE]? 14] TSX/DCLKXx ANLG LOOP {]}7 raf] SIGX/ASEL FSR/TSRE(]7 10, JFSX/TSXE 
PcMIN[{s 13] PCM OUT sicR(s 17 ]TSX/OCLKx DGTLGND[]s__9f JCLKR/CLKX 


FSR/TSRE (}9 
DGTL GND {]10 


12[ | FSX/TSXE 
111} CLKR/CLKX PCM IN {| 
FSR/TSRE [] 


DGTL GND [] 


TCM29C14... 
(TOP VIEW) 


1 

je} 
ios 
= 
a 
— 
3 


10 
VW 
12 


\{] PWRO + 


16f] PCM OUT 
15{ ] FSX/TSXE 


Gsr\ fs 25[] ANLG IN+ 
NC fj6 24(] ANLG GND 
PDN [7 23[] NC 
CLKSEL [8 22(] SIGX/ASEL 
ANLG LooP[]9 21{] TSX/DCLKX 


SIGR [J10 
OCLKR [J11 


w 


PCM IN {J 
FSR/TSRE ] 
DGTL GND JJ 


NC —No internal connection 


y 
Aiad 


> 


204] NC 
19[] PCM OUT 


“ 


— 
1S} 
z 


FSX/TSXE [J 


Caution. These devices have limited built-in gate protection. The leads should be shorted together or the device 
placed in conductive foam during storage or handling to prevent electrostatic damage to the MOS gates. 


ADVANCE INFORMATION documents contain 
information on new products in the sampling or 
preproduction phase of development. Characteristic 
data and other specifications are subject to change 
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TOM29€13, TCM29C14, TCM29C16, TCM29C17 
COMBINED SINGLE-CHIP PCM CODEC AND FILTER 


description 


The TCM29C13, TCM29C14, TCM29C16, and TCM29C17 are single-chip pulse-code-modulated encoders 
and decoders (PCM codecs) and PCM line filters. These devices provide all the functions required to interface 
a full-duplex (4-wire) voice telephone circuit with a time-division-multiplexed (TDM) system. These devices 
are intended to replace the TCM2910A or TCM2911A in tandem with the TCM2912B/C. Primary 
applications of the devices include: 

@ Line Interface for Digital Transmission and Switching of T1 Carrier, PABX, and Central Office 

Telephone Systems 

@ Subscriber Line Concentrators 

®@ Digital Encryption Systems 

® Digital Voice Band Data Storage Systems 

@ Digital Signal Processing 


These devices are designed to perform the transmit encoding (A/D conversion) and receive decoding (D/A 
conversion) as well as the transmit and receive filtering functions in a pulse-code-modulated system. They 
are intended to be used at the analog termination of a PCM line or trunk. 


The TCM29C13, TCM29C14, TCM29C16, and TCM29C17 provide the bandpass filtering of the analog 
signals prior to encoding and after decoding. These combination devices perform the encoding and decoding 
of voice and call progress tones as well as the signaling and supervision information. 


The TCM29C13, TCM29C14, TCM29C16, and TCM29C17 are characterized for operation from O°C to 
70°C. 


functional block diagram 


TRANSMIT 
: SECTION 


me SUCCESSIVE eatouad 
ANLG IN+ 
: APPROXI- OUTPUT TSX/DCLKX 
ANLG IN— MATION REGISTER 


SIGX/ASEL 
GSX 


FSX/TSXE 
CLKX 


RECEIVE 


SECTION 
CLKSEL 


CONTROL PDN 
GSR 4 ANLG Loopt 


DIGITAL PCM IN 
TO ANALOG 
CONTROL 
LOGIC OLCKR 


PWRO+ SAMPLE 
AND HOLD 
AND DAC 


PWRO- sicrt 


Vec Veep OGTL ANLG FSR/TSRE 
GND cLKRt 
TTCM29C14 ONLY 
+TCM29C13, TCM29C16, AND TCM29C17 ONLY 


F-10 TEXAS wy 


INSTRUMENTS 


POST OFFICE BOX 225012 ® DALLAS, TEXAS 75265 


TCM29C18, TCM29C1S 
ANALOG INTERFACE FOR DSP 


03036, AUGUST 1987 


@ Reliable Silicon-Gate CMOS Technology N DUAL-IN-LINE PACKAGE 
(TOP VIEW) 


@ Low Power Consumption 
Operating Mode . . . 80 mW Vea! Ute vec 
Power-Down Mode... 5 mW PWRO + []2 15[ 1|GSX 

p-Law Coding PWRO-L]3 = 14] ANLG IN 


© Excellent Power Supply Rejection Ratio over PONLJ4 = 13[ JANLG GND 


Frequency Range of 0 to 50 kHz DCLKR[JS = 12] TSX/DCLKX 
PCM INLJ6 = 11. JPCM OUT 
® No External Components Needed for FSR/TSREL]?  10[]FSX/TSXE 


Sample, Hold, and Auto-Zero Functions DGTL GND [}8 91 CLK 


Precision Internal Voltage References 


@ Single Chip Contains A/D, D/A, and 
Associated Filters 


FEATURE TABLE 


16 Pins 
p-Law Coding 
Variable Mode: 
64 kHz to 2.048 MHz 


Fixed Mode: 
2.048 MHz (TCM29C18), 
1.536 MHz (TCM29C19) 

8-Bit Resolution 
12-Bit Dynamic Range 


description 
The TCM29C18 and the TCM29C19 are low-cost single-chip pulse-code-modulated encoders and decoders 
(PCM codecs) and PCM line filters. These devices incorporate both the A/D and D/A functions, an anti- 
aliasing filter (A/D), and a smoothing filter (D/A). The TCM29C18 and the TCM29C19 are ideal for use 
with the TMS320 family members, particularly those featuring a serial port such as the TMS32020, 
TMS32011, and TMS320C25. 


Primary applications of these devices include: 


Digital Encryption Systems 
Digital Voice-Band Data Storage Systems 
Digital Signal Processing 


These devices are designed to perform encoding of analog input signals (A/D conversion) and decoding 
of digital PCM signals (D/A conversion). They are useful for implementation in the analog interface of a 
digital-signal processing system. Both devices also provide band-pass filtering of the analog signals prior 
to encoding and smoothing after decoding. 


The analog input is encoded into an 8-bit digital representation by use of the p-law encoding scheme 
(CCITT G.711) which equates to 12 bits of resolution for low amplitude signals. Similarly, the decoding 
section converts 8-bit PCM data into an analog signal with 12 bits of dynamic range. The filter characteristics 
(bandpass) for the encoder and decoder are determined by a single clock input (CLK). The filter roll-off 
(—3 dB) is derived by: 


“feo = k « fcLK/256 for the TCM29C18 or fog = k « fCLK/192 for the TCM29C19 


where k has a value of 0.44 for the high-frequency roll-off point, and a value of 0.019 for the low-frequency 
roll-off point. 


ADVANCE INFORMATION documents contain . Copyright © 1987, Texas instruments Incorporated 
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TCM29C18, TCM29C19 
ANALOG INTERFACE FOR DSP 


description (continued) 


The sampling rate of the ADC is determined by the Frame Sync Clock, FSX; the sampling rate of the DAC 
is determined by the Frame Sync Clock, FSR. Once a conversion is initiated by FSX or FSR, data is clocked 
in or out on the next consecutive eight clock pulses in the fixed data rate mode: Likewise, data may also 
be transferred on the next eight consecutive clock pulses of the data clocks, DCLKX and DCLKR, in the 
variable data rate mode. In the variable data rate mode, DCLKX and DCLKR are independent, but must 
be in the range from fCLK/32 to fcLk. 


The TCM29C18 and TCM29C19 are characterized for operation over the temperature range of O°C to 70°C. 


functional block diagram 


TRANSMIT 
SECTION 


ANLG IN 


RECEIVE 
SECTION 


PWRO+ 


PWRO- 


SAMPLE 
AND HOLD 
AND DAC 


Vcc Veg OGTL 


SUCCESSIVE 
APPROXI- 
MATION 


OUTPUT 


COMPARA. 
TOR REGISTER 


ANALOG 
TO DIGITAL 


cowreon ft 


LOGIC 


SAMPLE 
AND HOLD 
AND DAC 


ANLG FSR/TSRE 


GND 


PCM OUT 
TSX/DCLKX 


PCM IN 


DCLKR 
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TLC32040M, TLC320401 
ANALOG INTERFACE CIRCUIT 


02964, FEBRUARY 1987 


@ ADVANCED LinCMOS™ Silicon Gate Process J OR N PACKAGE 
Technology (TOP VIEW) 


14-Bit Dynamic Range ADC and DAC 


10-Bit ADC and DAC Linearity Over Any 
10-Bit Range 


@ Variable ADC and DAC Sampling Rate Up to 
19,200 Samples per Second 


® Switched-Capacitor Antialiasing Input Filter 
and Output-Reconstruction Filter 


@ Serial Port for Direct Interface to 
™iS32011, TMS32020, and TMS32025 
Digital Processors 


@® Synchronous or Asynchronous ADC and 
DAC Conversion Rates with Programmable 
Incremental ADC and DAC Conversion NU Nonusable; no external connection 
Timing Adjustments should be made to these pins 


@ Serial Port Interface to SN54299 or 
SN74299 Serial-to-Parallel Shift Registers 
for Paraile! Interface to TMS32010 or Other 
Digital Processors 


description 


The TLC32040 is a complete analog-to-digital and digital-to-analog input/output system on a single 
monolithic CMOS chip. This device integrates a bandpass switched-capacitor antialiasing input filter, a 
14-bit resolution A/D converter, four microprocessor-compatible serial port modes, a 14-bit resolution D/A 
converter, and a low-pass switched-capacitor output-reconstruction filter. The device offers numerous 
combinations of Master Clock input frequencies and conversion/sampling rates, which can be changed 
via digital processor control. 


Typical applications for this IC include modems (7.2-, 8-, 9.6-, 14.4-, and 19.2-kHz sampling rate), analog 
interface for digital signal processors, speech recognition/storage systems, industrial process control, 
biomedical instrumentation, acoustical signal processing, spectral analysis, data acquisition, and 
instrumentation recorders. Four serial modes, which allow direct interface to the TMS32011, TMS32020, 
and TMS32025 digital signal processors, are provided. Also, when the transmit and receive sections of 
the Analog Interface Circuit (AIC) are operating synchronously, it will interface to two SN54299 or SN74299 
serial-to-parallel shift registers. These serial-to-parallel shift registers can then interface in parallel to the 
TMS32010, other digital signal processors, or external FIFO circuitry. Output data pulses are emitted to 
inform the processor that data transmission is complete, or to allow the DSP to differentiate between two 
transmitted bytes. A flexible control scheme is provided so that the functions of the IC can be selected 
and adjusted coincidentally with signal processing via software control. 


The antialiasing input filter comprises seventh-order and fourth-order CC-type (Chebyshev/elliptic 
transitional) low-pass and high-pass filters, respectively, and a fourth-order equalizer. The input filter is 
implemented in switched-capacitor technology and is preceded by a continuous time filter to eliminate 
any possibility of aliasing caused by sampled data filtering. When no filtering is desired, the entire composite 
filter can be switched out of the signal path. A selectable, auxiliary, differential analog input is provided 
for applications where more than one analog input is required. 


ADVANCED LinCMOS” is a trademark of Texas Instruments Incorporated 


PRODUCT PREVIEW documents contain information 3 Copyright © 1987, Texas Instruments Incorporated 
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TLC32040M, TLC320401 
ANALOG INTERFACE CIRCUIT 


description (continued) 


The A/D and D/A converters each have 14 bits of resolution with 10 bits of integral linearity guaranteed 
over any 10-bit range. The A/D and D/A architectures guarantee no missing codes and monotonic operation. 
An internal voltage reference is provided to ease the design task and to provide complete control over 
the performance of the IC. The internal voltage is brought out to a pin and is available to the designer. 
Separate analog and digital voltage supplies and grounds are provided to minimize noise and ensure a wide 
dynamic range. Also, the analog circuit path contains only differential circuitry to keep noise to an absolute 
minimum. The only exception is the DAC sample-and-hold, which utilizes pseudo-differential circuitry. 


The output-reconstruction filter is a seventh-order CC-type (Chebyshev/elliptic transitional low-pass filter 
with a fourth-order equalizer) and is implemented in switched-capacitor technology. This filter is followed 
by a continuous-time filter to eliminate images of the digitally encoded signal. 


The TLC32040M is characterized for operation over the full military temperature range of —55°C to 125°C, 
and the TLC32040I is characterized for operation from —40°C to 85°C. 


functional block diagram 


IN+ SERIAL FSR 
PORT 
a DR 
EOOR 
AUX IN + 
MSTR CLK 
AUX IN- 
RRcRWAE SHIFT CLK 
_——eoO Oo VOLTAGE 
REFERENCE WORD/BYTE 
Dx 
FSX 
OUT+ — 
EODX 
OUT - 


TRANSMIT SECTION 


Vec+ Vec- ANLG DTGL Vpp REF RESET 
GND GND (DIG) 
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TLC32040M, TLC320401 
ANALOG INTERFACE CIRCUIT 


PRINCIPLES OF OPERATION 


analog input 


A/D 


A/D 


Two sets of analog inputs, IN¢+, IN—, and AUX IN+, AUX IN -, are provided. Each input set can be operated 
in either differential or single-ended modes, since sufficient common-mode range and rejection are provided. 
Normally, the IN+ and iN — inputs are used; however, the auxiliary inputs, AUX IN+ and AUX IN —, can 
be used if a second input is required. The gain for the IN+, IN—, and auxiliary AUX IN+ and AUX IN — 
inputs can be programmed to either 1, 2, or 4 (see the Gain Control Table). Either input circuit can be 
selected via software control. It is important to note that a wide dynamic range is assured by the differential 
internal analog architecture and by the separate analog and digital voltage supplies and grounds. 


bandpass filter, A/D bandpass filter clocking, and A/D conversion rate timing 


The A/D bandpass filter can be selected or bypassed via software control. The frequency response of this 
filter is presented in the following pages. This response results when the switched-capacitor filter clock 
frequency is 288 kHz. Several possible options can be used to attain a 288-kHz switched-capacitor filter 
clock. When the filter clock frequency is not 288 kHz, the filter transfer function is frequency-scaled by 
the ratio of the actual clock frequency to 288 kHz. The low-frequency roll-off of the high-pass section 
is 300 kHz. However, the high-pass section low-frequency roll-off can be changed to 200 kHz with a metal 
mask option. 


The Internal Timing Configuration and AIC DX Data Word Format sections of this data sheet indicate the 
many options for attaining a 288-kHz bandpass switched-capacitor filter clock. These sections indicate 
that the RX Counter A can be programmed to give a 288-kHz bandpass-switched capacitor filter clock 
for several Master Clock input frequencies. 


The A/D conversion rate is then attained by frequency-dividing the 288-kHz bandpass switched-capacitor 
filter clock with the RX Counter B. Thus, unwanted aliasing is prevented because the A/D conversion rate 
is an integral submultiple of the bandpass switched-capacitor filter sampling rate, and the two rates are 
synchronously locked. 


converter performance specifications 


Fundamental performance specifications for the A/D converter circuitry are presented in the A/D converter 
operating characteristics section of this data sheet. The realization of the A/D converter circuitry with 
switched-capacitor techniques provides an inherent sample-and-hold. 


analog output 


D/A 


The analog output circuitry is an analog output power amplifier. Both noninverting and inverting amplifier 
outputs are brought out of the IC. This amplifier can drive transformer hybrids or low-impedance loads 
directly in either a differential or single-ended configuration. 


low-pass filter, D/A low-pass filter clocking, and D/A conversion rate timing 


The frequency response of this filter is presented in the following pages. This response results when the 
low-pass switched-capacitor filter clock frequency is 288 kHz. Like the A/D filter, the transfer function 
of this filter is frequency-scaled when the clock frequency is not 288 kHz. A continuous-time filter is provided 
on the output of the D/A low-pass filter to greatly attenuate any switched-capacitor clock feedthrough. 


The D/A conversion rate is then attained by frequency-dividing the 288-kHz switched-capacitor filter clock 
with TX Counter B. Thus, unwanted aliasing is prevented because the D/A conversion rate is an integral 
submultiple of the switched-capacitor low-pass filter sampling rate, and the two rates are synchronously 
locked. 
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TLC32040M, TLC320401 
ANALOG INTERFACE CIRCUIT 


PRINCIPLES OF OPERATION (continued) 


asynchronous versus synchronous operation 


If the transmit section of the AIC (low-pass filter and DAC) and receive section (bandpass filter and ADC) 
are operated asynchronously, the low-pass and band-pass filter clocks are independently generated from 
the Master Clock signal. Also, the D/A and A/D conversion rates are independently determined. If the 
transmit and receive sections are operated synchronously, the low-pass filter clock drives both low-pass 
and band-pass filters. In synchronous operation, the A/D conversion timing is derived from, and is equal 
to, the D/A conversion rate timing. (See description of the WORD/BYTE pin in the Pin Functional Description 
Section.) . 


D/A converter performance specifications 


Fundamental performance specifications for the D/A converter circuitry are presented in the D/A converter 
operating characteristics section of the data sheet. The D/A converter has a sample-and-hold that is realized 
with a switched-capacitor ladder. 


system frequency response correction 


MalA3dY¥d LONGOYd 


Sin x/x correction circuitry is performed in digital signal processor software. The system frequency response 
can be corrected via DSP software to +0.1 dB accuracy to a band-edge of 3000 Hz for all sampling rates. 
This correction is accomplished with a first-order digital correction filter, which requires only seven TMS320 
instruction cycles. With a 200-ns instruction cycle, seven instructions represent an overhead factor of 
only 1.1% and 1.3% for sampling rates of 8 and 9.6 kHz, respectively (see the sin x/x Correction Section 
for more details). 


serial port 


The serial port has four possible modes that are described in detail in the pin description section. These 
modes are briefly described below. 


1. The transmit and receive sections of the AIC are operated asynchronously, and the AIC serial 
port interfaces directly with the TMS32011. 


2. The transmit and receive sections of the AIC are operated asynchronously, and the AIC serial 
port interfaces directly with the TMS32020 and the TMS32025. 


3. The transmit and receive sections of the AIC are operated synchronously, and the AIC serial port 
interfaces directly with the TMS32011. 


4. The transmit and receive sections of the AIC are operated synchronously, and the AIC serial port 
interfaces directly with the TMS32020, TMS32025, or two SN54299 or SN74299 serial-to- 
parallel shift registers, which can then interface in parallel to the TMS32010, to any other digital 
signal processor, or to external FIFO circuitry. 


testing 


An addendum accompanying this data sheet fully describes the test capabilities of the IC, provided by 
the design. 


internal voltage reference 


The internal reference eliminates the need for an external voltage reference, and thus provides overall circuit 
cost reduction. Additionally, the internal reference makes the performance of the IC less susceptible to 
noise. Thus, the internal reference eases the design task ar“ orovides complete control over the performance 
of the IC. The internal reference is brought out to a pi.. and is available to the designer. To keep the amount 
of noise on the reference signal to a minimum, an external capacitor may be connected between REF and 
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TLCO32040M, TLC320401 
ANALOG INTERFACE CIRCUIT 


PRINCIPLES OF OPERATION (continued) 


reset 


A reset function is provided to initiate serial communications between the AIC and DSP and to allow fast, 
cost-effective testing during manufacturing. The reset function will initialize all AIC registers, including 
the control register. The reset pin has an internal pull-up resistor. After a negative-going pulse on the RESET 
pin, the AIC will be initialized. This initialization allows normal serial port communications activity to occur 
between AIC and DSP (see AIC DX Data Word Format section). 


loopback 


This feature allows the user to test the circuit remotely. In loopback, the OUT + and OUT — pins are internally 
connected to the IN+ and IN-— pins. Thus, the DAC bits (d15 to d2), which are transmitted to the DX 
pin, can be compared with the ADC bits (d15 to d2), which are received from the DR pin. An ideal comparison 
would be that the bits on the DR pin equal the bits on the DX pin. However, in practice there will be some 
difference in these bits due to the ADC and DAC output offsets. 


The loopback feature is implemented with digital signal processor control! by transmitting the appropriate 
serial port bit to the control register (see AIC Data Word Format section). 


re 
FANLG GND 17,18] | Analog ground return for all internal analog circuits. Not internally connected to DGTL GND. > 
AUX IN+ 24 Noninverting auxiliary analog input stage. This input can be switched into the bandpass filter and A/D converter} LLJ 
path via software control. If the appropriate bit in the Control register is a 1, the auxiliary inputs will replace cc 
the IN+ and IN— inputs. If the bit is a 0, the IN+ and IN— inputs will be used (see the AIC DX Data Word A. 
Format section). _ 
FAUX IN- 23s | Inverting auxiliary analog input (see the above AUX IN + pin description). oe) 
JDOGTLGND 9 | | Digital ground for all internal logic circuits. Not internally connected to ANLG GND. = 
This pin is used to transmit the ADC output bits from the AIC to the TMS320 serial port. This transmission 
of bits from the AIC to the TMS320 seria! port is synchronized with the SHIFT CLK signal. QO 
This pin is used to receive the DAC input bits and timing and control information from the TMS320. This serial © 
transmission from the TMS320 seria! port to the AIC is synchronized with the SHIFT CLK signal. co 
(See the WORD/BYTE pin description and the Serial Port Timing Diagram.) During the word-mode Oo. 
timing, this signal is a low-going pulse that occurs immediately after the 16 bits of A/D information have been 
transmitted from the AIC to the TMS320 serial port. This signal can be used to interrupt a microprocessor 
upon completion of serial communications. Also, this signal can be used to strobe and enable external serial- 
to-paralle! shift registers, latches, or external FIFO RAM, and to facilitate paralle! data bus communications 
between the AIC and the serial-to-parallel shift registers. During the byte-mode timing, this signal goes low 
after the first byte has been transmitted from the AIC to the TMS320 serial port and is kept low until the 
second byte has been transmitted. The TMS32011 can use this low-going signal to differentiate between 
the two bytes as to which is first and which is second. 
TEXAS 4 
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TLE32040M, TLC320401 
ANALOG INTERFACE CIRCUIT 


DESCRIPTION 
0 


EODX 11 (See the WORD/BYTE pin description and the Serial Port Timing Diagram.) During the word-mode 
timing, this signal is a low-going pulse that occurs immediately after the 16 bits of D/A converter and control 
or register information have been transmitted from the TMS320 serial port to the AIC. This signal can be used 
to interrupt a microprocessor upon the completion of serial! communications. Also, this signal can be used 
to strobe and enable external serial-to-paralle! shift registers, latches, or an external FIFO RAM, and to facilitate 
parallel, data-bus communications between the AIC and the serial-to-paraillel shift registers. During the byte- 
mode timing, this signal goes low after the first byte has been transmitted from the TMS320 seria! port to 
the AIC and is kept low until the second byte has been transmitted. The TMS32011 can use this low-going 
signal to differentiate between the two bytes as to which is first and which is second. 


In the serial transmission modes, which are described in the WORD/BYTE pin description, the FSR pin is held 
low during bit transmission. When the FSR pin goes low, the TMS320 serial port will begin receiving bits from 
the AIC via the OR pin of the AIC. The most significant DR bit will be present on the DR pin before FSR goes 
low. (See Serial Port Timing and Internal Timing Configuration Diagrams.) 


When this pin goes low, the TMS320 serial port will begin transmitting bits to the AIC via the 
DX pin AIC. In all serial transmission modes, which are described in the WORD/BYTE pin description, the FSX 
pin is held low during bit transmission (see Serial Port Timing and Internal Timing Configuration Diagrams). 


The Master Ciock signal is used to derive all the key logic signals of the AIC, such as the Shift Clock, the 
switched-capacitor filter clocks, and the A/D and D/A timing signals. The Internal Timing Configuration diagram 
shows how these key signals are derived. The frequencies of these key signals are synchronous submultiples 
of the Master Clock frequency to eliminate unwanted aliasing when the sampled analog signals are transferred 
between the switched-capacitor filters and the A/D and D/A converters (see the Internal Timing Configuration). 
Noninverting output of analog output power amplifier. Can drive transformer hybrids or high-impedance loads 
directly in either a differential or a single-ended configuration. 


IN + 26 


Inverting output of analog output power amplifier; functionally identical with and complementary to OUT + 

A reset function is provided to initialize the TA, TA’, TB, RA, RA’, RB, and control registers. This 
reset function initiates serial communications between the AIC and DSP. The reset function will initialize all 
AIC registers including the control register. After a negative-going pulse on the RESET 
pin, the AIC registers will be initialized to provide an 8-kHz data conversion rate for a 5.184-MHz master clock 
input signal. The conversion rate adjust registers, TA’ and RA’, will be reset to 1. The CONTROL register bits 
will be reset as follows (see AIC DX Data Word Format section). 


d7 = 1,d6 = 1,d5 = 1,d4 = 0,d3 =0,d2 = 1 


MalAsSdd LONGOYd 


This initialization allows normal serial-port communication to occur between AIC and DSP. This pin has an 
internal puil-up resistor and is set to a high logic level unless it is pulled to ground. 
The Shift Clock signal is obtained by dividing the Master Clock signal frequency by four. This signal is used 
to clock the serial data transfers of the AIC, described in the WORD/BYTE pin description 
below (see the Serial Port Timing and Internal Timing Configuration diagram). 


Digital supply voitage, 5 V +5% 
Positive analog supply voltage, 5 V +5% 


Negative analog supply voltage —5 V +5% 


SHIFT CLK = =10 
Vop 7 
Vec+ 20 
Vcc - 19 


: 
m 
_ 

R 
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TLE32040M, TLC320401 
ANALOG INTERFACE CIRCUIT 


H Serial port wilt directly interface with the serial port of the TMS32020 and communicates in one 
16-bit word. The operation sequence is as follows (see Serial Port Timing diagrams): 
1. The FSX or FSR pin is brought low. 
2. One 16-bit word is transmitted or one 16-bit word is received. 
3. The FSX or FSR pin is brought high. 
4. The EODX or EODR pin emits a low-going pulse. 


DESCRIPTION 
vo 
AlC transmit and receive sections are operated synchronously. 


WORD/BYTE 13 This pin, in conjunction with a bit in the CONTROL register, is used to establish one of four serial 
modes. These four serial modes are described below. This pin has an internal pull-up resistor and is set to 
a logic high unless it is pulled to ground., 
AlC transmit and receive sections are operated asynchronously. 
The following description applies when the AIC is configured to have asynchronous transmit and receive sections. 
If the appropriate data bit in the Contro! register is a O (see the AIC DX Data Word Format), the transmit and 
receive sections will be asynchronous. 
L Serial port will directly interface with the serial port of the TMS32011 and communicates in two 
8-bit bytes. The operation sequence is as follows (see Serial Port Timing diagrams). 
1. The FSX or FSR pin is brought tow. 
; 2. One 8-bit byte is transmitted or one 8-bit byte is received. 
3 
4 
If the appropriate data bit in the Control register is a 1, the transmit and receive sections will be configured 


. The EODX or EODR pin is brought low. 
. The FSX or FSR pin emits a positive frame-sync pulse that is 
four Shift Clock cycles wide. 
5. One 8-bit byte is transmitted or one 8-bit byte is received. 

6. The EODX or EODR pin is brought high. 

7. The FSX or FSR pin is brought high. 
to be synchronous. In this case, the bandpass switched-capacitor filter and the A/D conversion timing will 
be derived from the TX Counter A, TX Counter B, and TA, TA’, and TB registers, rather than the RX Counter 
A, RX Counter B, and RA, RA’, and RB registers. {n this case, the AIC FSX and FSR timing will be identical, 
as will the EODX and EODA timing. The synchronous operation sequences are as follows (see Serial Port Timing 
diagrams). 

L Serial port will directly interface with the serial port of the TMS32011 and communicates in two 
8-bit bytes. The operation sequence is as follows (see Serial Port Timing diagrams): 

1. The FSX and FSR pins are brought low. 

2. One 8-bit byte is transmitted and one 8-bit byte is received. 

3. The EODX and EODR pins are brought low. 

4. The FSX and F5R pins emit positive frame-sync pulses that are 

four Shift Clock cycles wide. 

5. One 8-bit byte is transmitted and one 8-bit byte is received. 

6. The EODX and EODR pins are brought high. 

7. The FSX and FSR pins are brought high. 

H Serial port will directly interface with the serial port of the TMS32020 and communicates in one 
16-bit word. The operation sequence is as follows {see Serial Port Timing diagrams): 

1. The FSX and FSR pins are brought low. 

2. One 16-bit word is transmitted and one 16-bit word is received. 

3. The FSX and FSR pins are brought high. 

4. The EODX or EODR pins emit tow-going pulses. 
Since the transmit and receive sections of the AIC are now synchronous, the AIC serial port, with additional 
NOR and AND gates, wil! interface to two SN54299 or SN742939 serial-to-paraliel shift registers. Interfacing 
the AIC to the SN54299 or SN74299 shift register allows the AIC to interface to an external FIFO RAM and 
facilitates parallel, data bus communications between the AIC and the digital signal processor. The operation 
sequence is the same as the above sequence (see Serial Port Timing diagrams). 
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TLE32040M, TLC320401 
ANALOG INTERFACE CIRCUIT 


INTERNAL TIMING CONFIGURATION 


MASTER CLOCK = — — —————-—--—-—4 eno 
5.184 MHz (1)¢ DIVIDE BY 4 1.296 MHz (1) 
ee re eee ae oe ae oe a ee el 2.692 MHz (2) 
20.736 MHz (1) Se Se ee ee ee a a ee ee ee op 
41.472 MHz (2) : 
XTAL TMS320 TA REGISTER Pare | 
osc DSP ‘eeiysi (6 BITS) 
| (2's COMPL) 
| LOWPASS | 
SWITCHED | 
OPTIONAL EXTERNAL CIRCUITRY | ADDER/ DIVIDER CIRCUITRY CAP FILTER 
FOR FULL DUPLEX MODEMS SUBTRACTOR CLK = 288 kHz | 
H 153.6 kHz 71 (6 BITS) SQUARE WAVE 1 
LY Powive [SLO (YF commerciac II TB REGISTER | 
} [BY 135 EXTERNAL | | (6 BITS) 
FRONT-END | f | 
1! FULL-DUPLEX | Q O 
| SPLIT-BAND 7 do.d,=0,1 : | 
t do.d4=1,0 
L Maat a a TX COUNTER B | 
a aa ar i TOES Fy —— Soe Liege sia 
TB=36; 8.0 kHz CONVERSION | 
"U | (TA= 9(1)1 TB=30; 9.6 kH 
(TA = 18 (2}] B=30; 9.6 kHz FREQUENCY | 
8) | 576-kH2z TB=20; 14.4 kHz 
oO eels! PULSES TB= 15; 19.2 kHz | 
Y RA’ REGISTER | 
J |] Ra REGISTER @cire| 
‘@) PEltel (2's COMPL) 
—| | BANDPASS | 
SWITCHED l 
u | ADDER/ DIVIDER CIRCUITRY CAP FILTER 
| SUBTRACTOR CLK = 288 kHz | 
J (6 BITS) SQUARE WAVE 
rr 
< | RB REGISTER | 
| (6 BITS) * | 
bed 
mi | J } | 
= do.dq=0,0 do.dy = 0.1 
=1,1% ° = 
do.dy=1,1 ‘ do.d_4=1,0 RX COUNTER B 
B=40; 7.2 kH 
| RX COUNTER A Ana ate Sis AID 
| (RA = 9 (1)] Sine CONVERSION | 
RA = 18 (2)] Ree eer ere FREQUENCY 
| ie Bins 576-kHz RB= 20; 14.4 kHz | 
: : PULSES RB= 15; 19.2 kHz 


Oe ee acts tak, ee 


NOTE: Frequency 1, 20.736 MHz, is used to show how 153.6 kHz (for a commercially available modem split-band filter clock), popular 
speech and modem sampling signal frequencies, and an internal 288-kHz switched-capacitor filter clock can be derived synchronously 
and as submultiples of the crystal oscillator frequency. Since these derived frequencies are synchronous submultiples of the crystal 
frequency, aliasing does not occur as the sampled analog signal passes between the analog converter and switched-capacitor filter 
stages. Frequency 2, 41.472 MHz, is used to show that the AIC can work with high-frequency signals, which are used by high- 
speed digital signal processors. . 

TSplit-band filtering can alternatively be performed after the analog input function via software in the TMS320. 

+These control bits are described in the AIC DX Data Word Format section. 
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. TLC32040M, TLC32040) 
ANALOG INTERFACE CIRCUIT 


explanation of internal timing configuration 


All of the internal timing of the AIC is derived from the high-frequency clock signal that drives the Master 
Clock input pin. The Shift Clock signal, which strobes the serial port data between the AIC and DSP, is 
derived by dividing the Master Clock input signal frequency by four. 


TX Counter A and TX Counter B, which are driven by the Master Clock signal, determine the D/A conversion 
period timing. Similarly, RX Counter A and RX Counter B determine the A/D conversion period timing. In 
order for the switched-capacitor low-pass and band-pass filters to meet their transfer function specifications, 
the frequency of the clock inputs of the switched-capacitor filter must be 288 kHz. If the frequencies of 
the clock inputs are not 288 kHz, the filter transfer function frequencies are scaled by the ratios of the 
clock frequencies to 288 kHz. Thus, to obtain the specified filter responses, the combination of Master 
Clock frequency and TX Counter A and RX Counter A values must yield 288-kHz switched-capacitor clock 
signals. These 288-kHz clock signals can then be divided by the TX Counter B and RX Counter B to establish 
the D/A and A/D conversion period timings. 


TX Counter A and TX Counter B are reloaded every D/A conversion period, while RX Counter A and RX 
Counter B are reloaded every A/D conversion period. The TX Counter B and RX Counter B are loaded with 
the values in the TB and RB Registers respectively. Via software control, the TX Counter A can be loaded 
with either the TA Register, the TA Register less the TA’ Register, or the TA Register plus the TA’ Register. 
By selecting the TA Register less the TA’ Register option, the upcoming conversion period timing wil! occur 
earlier by an amount of time that equals TA’ times the signal period of the Master Clock. By selecting 
the TA Register plus the TA’ Register option, the upcoming conversion period timing will occur later by 
an amount of time that equals TA’ times the signal period of the Master Clock. Thus, the D/A conversion 
timing can be advanced or retarded. An identical ability to alter the A/D conversion timing is provided. 
In this case, however, the RX Counter A can be programmed via software contro! with the RA Register, 
the RA Register less the RA’ Register, or the RA Register plus the RA’ Register. 


The above feature is particularly useful for modem applications. This feature allows controlled changes 
in the A/D and D/A conversion timing. This feature can be used to enhance signal-to-noise performance, 
to perform frequency-tracking functions, and to generate nonstandard modem frequencies. 


If the transmit and receive sections are configured to be synchronous (see WORD/BYTE pin description), 
then both the low-pass and bandpass switched-capacitor filter clocks are derived from TX Counter A. Also, 
both the D/A and A/D conversion timing are derived from the TX Counter A and TX Counter B. When the 
transmit and receive sections are configured to be synchronous, the RX Counter A, RX Counter B, RA 
Register, RA’ Register, and RB Registers are not used. 
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TLCO820A, TLCO820B 


ADVANCED LinCMOS™ HIGH-SPEED 8-BIT ANALOG-TO-DIGITAL 
CONVERTERS USING MODIFIED “FLASH” TECHNIQUES 


Advanced LinCMOS” Silicon-Gate 
Technology 


8-Bit Resolution 

Differential Reference Inputs 

Parallel Microprocessor Interface 
Conversion Access Time Over Temperature 
Range 


Write-Read Mode .. . 1.18 zs and 1.92 ps 
Read Mode... 2.6 ps Max 


No External Clock or Oscillator Components 
Required 


On-Chip Track-and-Hold 


Low Power Consumption . .. 50 mW Typ 


Single 5-V Supply 


TLCO820B is Direct Replacement for 
National Semiconductor ADCOQ820B/BC and 
Analog Devices AD7820L/C/U; 

TLCO820A is Direct Replacement for 
National Semiconductor ADCO820C/CC and 
Analog Devices AD7820K/B/T 


description 


The TLCO820A and TLCO820B are Advanced 
LinCMOS™ 8-bit analog-to-digital converters 
each consisting of two 4-bit ‘‘flash’’ converters, 
a 4-bit digital-to-analog converter, a summing 
(error) amplifier, control logic, and a result latch 
circuit. The modified ‘‘flash’’ technique allows 
low-power integrated circuitry to complete an 
8-bit conversion in 1.18 microseconds over 
temperature. The on-chip track-and-hold circuit 
has a 100-nanosecond sample window and 
allows the TLCO820A and TLCO820B to convert 
continuous analog signals having slew rates of 
up to 100 millivolts per microsecond without 
external sampling components. TTL-compatible 
three-state output drivers and two modes of 
operation allow interfacing to a variety of 
microprocessors. Detailed information on 
interfacing to most popular microprocessors is 
readily available from the factory. 


02873, SEPTEMBER 1986—REVISED DECEMBER 1987 


TLCO820AM, TLCO820BM .. . DW, J OR N PACKAGE 
TLCO820AlI, TLCO820B! .. . DW OR N PACKAGE 
TLCO820AC, TLCO820BC ... DW OR N PACKAGE 


(TOP VIEW) 


TLCO820AM, TLCO820BM . . . FK PACKAGE 
TLCO820AlI, TLCO820B1. . . FN PACKAGE 
TLCO820AC, TLCO820BC . .. FN PACKAGE 


(TOP VIEW) 


O 
OU 
>Zz 
om 2 on 


(7 DO (LSB) 
-U ANLG IN 


20 19 


p24 18[] OFLW 
D3 {5 17] 07 (MSB) 
WRIRDY []6 16(] 06 
MODE fJ7 150] D5 
RD js 140] D4 


10 11 
entan 
rFay;++|” 
22Z2uLn/0 
Ow w 
co 


NC—No internal connection 


The TLCO820AM and TLCO820BM are available in the DW or N plastic and the J ceramic packages and 
are characterized for operation over the full military temperature range of — 55°C to 125°C. The TLCO820Al 
and TLCO820BI are characterized for operation from — 40°C to 85°C. The TLCO820AC and TLCO820BC 


are characterized for operation from O°C to 70°C. 


Advanced LinCMOS is a trademark of Texas Instruments. 
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ADVANCE INFORMATION documents contain 
information on new products in the sampling or 
preproduction phase of development. Characteristic 
data and other specifications are subject to change 
without notice. 
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TLCO820A, TLCO820B 
ADVANCED LinCMOS™ HIGH-SPEED 8-BIT ANALOG-TO-DIGITAL 
CONVERTERS USING MODIFIED “FLASH” TECHNIQUES 


functional block diagram 


(12) 


REF + 


(11) 


REF - 


(1) 
ANLG IN 


MODE 7) 
(6) 


WR/RDY 


RD 


4-BIT FLASH 
ANALOG-TO- 
DIGITAL 


fades CONVERTER 


(4 MSBs) 


a DIGITAL- ahG 
an Te eNAEOS 3-STATE 
CONVERTER 


BUFFERS 


4-BIT FLASH 
ANALOG-TO- 
DIGITAL 
CONVERTER 
(4 LSBs) 


TIMING 
AND 
CONTROL 


i 
TEXAS 
INSTRUMENTS 


POST OFFICE BOX 655012 * DALLAS, TEXAS 75265 


(18) 
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(14) 

(15) 

(16) 

(17) 
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D1 
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PRODUCT TLC1205A, TLC1205B, TLC1225A, TLC1225B 
PREVIEW SELF-CALIBRATING 12-BIT-PLUS-SIGN UNIPOLAR OR BIPOLAR 
ANALOG-TO-DIGITAL CONVERTERS 


02982, FEBRUARY 1987 


@ ADVANCED LinCMOS™ Technology TLCO1205 
J OR N DUAL-IN-LINE PACKAGE 
@ Self-Calibration Eliminates Expensive (TOP VIEW) 
Trimming at Factory and Offset Adjustment 
in the Field ANLG Vcc - [1 U 24D] DGTL Vcc 


IN- [2 23 } 012/D7/0 (status) 
IN+ []3 22| |] 012/D6/SARS 
+% and +1 LSB Linearity Error in Unipolar ANLG GND [ ]4 |] 012/D5/0/D15 
Configuration REF { }5 20{ ] D12/D4/0/D14 vO 
ANLG Vcc + Lj6 19] ] 011/D3/0/D13 BUS 
vos [|7 181] 010/D2/BYST/DI2 

CLK IN [8 17[] D9/D1/EOC/DI1 

WR 16{] D8/DO/INT/DIO 


12-Bit Plus Sign Unipolar or Bit Bipolar 


@ 10 ps Conversion Time (Mode 2) 
(clock = 2.6 MHz) 

20 ps Conversion Time (Mode 1) 
(clock = 2.6 MHz) 


Compatible with All Microprocessors RO []11 14] READY OUT 
. " DGTL GND [f12.—s 13[ ] STATUS 
True Differential Analog Voltage Inputs 

0 to 5 V Analog Voltage Range with Single TLOI225 


. : J OR N DUAL-IN-LINE PACKAGE 
5-V Supply (Unipolar Configuration) (TOP VIEW) 


© -5V to 5 V Analog Voltage Range with 


+5-V Supplies (Bipolar Configuration) ae oe or YCC 
Low Power... 25 mW Maximum iIN+ L]3 D11 
Replaces National Semiconductor ADC1205 ne yi 4: oe 
and ADC1225 in Mode 1 Operation ANLG Vcc: Ce bs 
description aia 4 at] zs ue 
The TLC1205 and TLC1225 converters are WR{]9 = 20.) D5 /p15 


manufactured with Texas Instruments highly CS[}to 19, pa/pi4 


efficient ADVANCED LinCMOS™ technology. RD []11 = 18[.] 03/D13 
Either of the TLC1205 or TLC1225 CMOS DGTL GND [12.17] D2/b12 
analog-to digital converters can be operated as READY OUT [}13 167 ] D1/D11 
a unipolar or bipolar converter. A unipolar input INT []14 1577] DO/DIO 


(0 to 5 V) can be accommodated with a single 

5-volt supply, while a bipolar input (—5 V to 5 V) requires the addition of a 5-volt negative supply. 
Conversion is performed via the successive-approximation method. The 24-pin TLC1205 outputs the 
converted data in two 8-bit bytes, while the TLC1225 outputs the converted data in a parallel word and 
interfaces directly to a 16-bit data bus. Negative numbers are given in the 2’s complement data format. 
All digital signals are fully TTL and CMOS compatible. 


These converters utilize a self-calibration technique by which seven of the internal capacitors in the 
capacitive ladder of the A/D conversion circuitry can be automatically or manually calibrated. If the 
converters are operated in Mode 1, one of the seven internal capacitors is calibrated during the first part 
of the conversion sequence. For example, one capacitor is calibrated during the first conversion. The next 
capacitor is calibrated during the second conversion. If the converters are operated in Mode 2, the internal 
capacitors are calibrated during a nonconversion, capacitor-calibrate cycle in which all seven of the internal 
capacitors are calibrated at the same time. A Mode 2 conversion requires only 10 ys (2.6 MHz clock) after 
the nonconversion, capacitor-calibrating cycle has been completed. The calibration or conversion cycle 
may be initiated at any time by issuing the proper address to the data bus. The self-calibrating techniques 
eliminate the need for expensive trimming of thin-film resistors at the factory and provide excellent 
performance at low cost. 


ADVANCED LinCMOS” is a trademark of Texas Instruments Incorporated 
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TL€1205A, TLO1205B, TLC1225A, TLC1225B PRODUCT 
SELF-CALIBRATING 12-BIT-PLUS-SIGN UNIPOLAR OR BIPOLAR PREVIEW 


ANALOG-TO-DIGITAL CONVERTERS 


functional block diagram 


i ST eet Tae etl ee ee Re Baa f 
MICROPROCESSOR | 
| 
8-BIT 8 | 8 
8-BIT SWITCH | 
ANLG Vcc — 5 eee | CONTROL | 
13-BIT | 
eeeg |__| CAPACITOR DAC | 8-BIT SAR 
AND S/H 
a : REGISTER 1 
REGISTER 2 8-BIT 
DATA 
IN- ALU PATH | 
REF 
8-BIT 8-WORD | 
CALIBRATION RAM | 
q g ADDRESS | ADDRESS | 
COUNTER | COUNTER 
5 V- 10 V TRANSLATOR 1 2 
13-BIT SWITCH CONTROL CLOCKS | 
13 BIT SAR 6 | 
INPUT DATA LATCHES | 
13-BIT CALIBRATION | 
CONTROL LOGIC : | 
! 
13-BIT DATA LATCH ! | 
| 
MUX CONTROL | 
7 ROM 
VO BUS | 
TLC1205: + = 8 | | 
TLO1225: # = 13 | 
vos ; PROGRAM I 
iNT ! COUNTER | 
cs u ! 
os Se aap |e Se Se a 
we 
RD 
READY OUT 
STATUS (TLC1205 ONLY) 


In Mode 1, these converters are replacements for National Semiconductor ADC1205 and ADC1225 
integrated circuits. The Mode 1 conversion time for guaranteed accuracy is 51 clock cycles. In the Mode 2 
operation, these devices are no longer true replacements. However, the Mode 2 conversion time for 
guaranteed accuracy is only 26 clock cycles. 


The TLC1205AM, TLC1205BM, TLC1225AM, and TLC1225BM are characterized for operation over the 
full military temperature range of ~55°C to 125°C. The TLC1205Al, TLC1205Bl, TLC1225Al, and 
TLC1225Bl are characterized for operation from --40°C to 85°C. 
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ADVANCE TLO7524 
INFORMATION Advanced LinCMOS™ 8-BiT MULTIPLYING 
DIGITAL-TO-ANALOG CONVERTER 
D3008, SEPTEMBER 1986 
@ Advanced LinCMOS™ Silicon-Gate D OR N PACKAGE 
Technology (TOP VIEW) 
@ Easily Interfaced to Microprocessors 
@ On-Chip Data Latches 
@ Guaranteed Monotonicity 
@ Segmented High-Order Bits Ensure Low- 
Glitch Output 
@ Designed to be interchangeable with Analog 
Devices AD7524, PMI PM-7524, and Micro 
Power Systems MP7524 
@ Fast Control Signaling for Digital Signal 
Processor Applications Including Interface 
with TMS320 
Resolution 8 Bits 
Linearity error Ye LSB Max 
Power dissipation 
5 mW Max 
at Vop = 5V 
Settling time 100 ns Max 
Propagation delay 80 ns Max 
description 


The TLC7524 is an Advanced LinCMOS™ 8-bit digital-to-analog converter (DAC) designed for easy interface 
to most popular microprocessors. 


The TLC7524 is an 8-bit multiplying DAC with input latches and with a load cycle similar to the ‘‘write’’ 
cycle of a random access memory. Segmenting the high-order bits minimizes glitches during changes in 
the most-significant bits, which produce the highest glitch impulse. The TLC7524 provides accuracy to 
¥% LSB without the need for thin-film resistors or laser trimming, while dissipating less than 5 milliwatts 
typically. 

Featuring operation from a 5-V to 15-V single supply, the TLC7524 interfaces easily to most microprocessor 


buses or output ports. Excellent multiplying (2 or 4 quadrant) makes the TLC 7524 an ideal choice for many 
microprocessor-controlled gain-setting and signal-control applications. 


The TLC75241 is characterized for operation from — 25°C to 85°C, and the TLC7524C is characterized 
for operation from O°C to 70°C. 


Advanced LinCMOS is a trademark of Texas Instruments Incorporated. 
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ADVANCE INFORMATION documents contain 
information on new products in the sampling or 
preproduction phase of development. Characteristic 
data and other specifications are subject to change 
without notice. 
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TLE7524 
Advanced LinCMOS™ 8-BIT MULTIPLYING 
DIGITAL-TO-ANALOG CONVERTER 


functional block diagram 


Vpp 


DATA INPUTS 


Operating sequence 
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Appendix F - Tl Sockets 


IC SOCKETS 
PLASTIC LEADED CHIP CARRIER 


PERFORMANCE SPECIFICATIONS 


Mechanical 

Recommended PCB thickness range: 0.062 in to 0.092 in 
Recommended PCB hole size range: 0.032 in to 0.042 in 
Vibration: 15 G 

Shock: 100 G 

Solderability: Per MIL-STD 202, Method 208 

Insertion force: 0.59 Ibs per position 

Withdrawal force: 0.25 Ibs per position 

Normal force: 200 g min, 450 g typ 

Wipe: 0.075 in min 

Durability: 5 cycles min 

Contact retention: 1.5 ibs min 


Electrical 

Current carrying capacity: 1 A 

Insulation resistance: 5000 MQ min 

Dielectric withstanding voltage: 1000 V ac rms min 
Capacitance: 1.0 pF max 


Environmental 

Operating temperature: 

Operating: — 40°C to 85°C 

Storage: ~40°C to 95°C 

Temperature cycling with humidity: will conform to final EIA 
specifications 

Shelf life: 1 year min 

MATERIALS 

Body — Ryton R-4 (40% glass) U/L 94-VO rating 

Contacts — CDA 510 spring temper 

Contact finish — 90/10 tin (200 pin — 400 pin) over 40 pin 
copper 


Contact factory for detailed information 


PLASTIC LEADER CHIP CARRIER CPR SERIES 


Device guide barriers not shown 


8,13 
(0.320) 

90° TYP 0,63 
2,54 (0.100) {0.025) 


TYP 


DEVICE GUIDE 
BARRIERS 


UNIQUE, HIGH 
NORMAL FORCE 
CONTACT 


EASILY nit tt 
AUTO INSERTED 


} 


CLOSED BOTTOM 
DESIGN 


PART NUMBER SYSTEM 


CPR PH XXX —-X-—-X-O 


Contact surface 1 — tin lead 
plating 
Contact spacing 1 — 0.050 in 
Number of pos (044, 052, 068, 084) 
Plated thru hole, solder tail 
TI socket Series 
Plastic leaded chip carrier 


2,54 
(0.100) TYP 


STANDOFF 


2.54 
(0.100) TYP 


a a ee 
21,43 17,78 12,70 
(0.844 | (0.700) | (0.500) 

52 23,98 20,32 15,24 

(0.944) | (0.800) | (0.600) 
29,06 25,40 20,32 
(1.144) } (1.000) | (0.800) 
34,14 30,48 25,40 
(1.344) | (1.200) | (1.000) 


Extraction tool available, consult factory. 
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IC SOCKETS 


PRODUCT FEATURES 

Can be loaded by top actuated insertion or press-in 
insertion, either manually or automatically 

High reliability due to high pressure contact point 

Open body and high stand-off design provide high efficiency 
in heat dissipation 

High durability up to 10,000 cycles 

Compact design 


PERFORMANCE SPECIFICATIONS 


Mechanical 

Durability: 10,000 cycles 

Operating Temperature: 180°C max 
Electrical 

Contact rating: 1.0 A per contact 
Contact resistance: 30 mf. max 


Insulation resistance: 1000 MQ min PART NUMBER SYSTEM 

Dielectric withstanding voltage: 500 V ac rms min CPJ AA33A — XXX B 

MATERIALS 

Body — ultem glass filled (U/L 94 VO) 

Contact — copper alloy Number of positions 


Plating — overall gold plate ; 
Tl series socket 


PLCC BURN-IN/TEST SOCKETS CPJ SERIES 


23,00 (0.906) 2,54 
(0.100) 


17,20 (0.677) 1,27 (0.050) 


{0.100} ® 
Ug -<-| ° 
nyt 1.27 (0.050) : © 
17,90 at HH ee 
(0.705) Cou Od co) ; 


6 
7.80 (0.307) ie, : 

12,10 ae 
(0.476) ja o0i 1.27 (0.050) 
3.81 (0.150) 5,08 (0.200) 

48.08 (0.712) 
SIZES: 18 PIN 
22 PIN 


5,08 (0.200) 
12,90 (0.507) 


14,00 (0.551) 


3.00 (0.118) 


0,50 | 0.35 


. 1,30 (0.051) 
{0.020} (0.014) 


Dimensions in parentheses are inches 
Contact factory for detailed information 
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PERFORMANCE SPECIFICATIONS 


Mechanical 

Accommodates IC leads 0.015 in to 0.021 in diameter 

Recommended PCB thickness range: 0.062 in to 0.092 in 

Recommended PCB hole size range: 0.032 in to 0.042 in 

Recommended hole grid pattern: 0.100 in + 0.002 in each 
direction 

Vibration: 15 G, 10-2000 Hz per MIL-STD 1344A, 
Method 2005.1 Test Condition Ill 

Shock: 100 G, sawtooth waveform, 2 shocks each direction 
per MIL-STD 202, Method 213, Test Condition | 

Durability: 5 cycles, 10 mQ max contact resistance change 
per MIL-STD 1344, Method 2016 

Solderability: per MIL-STD 202, Method 208 

Insertion force: 3.6 oz (102 g) per pin typ using 0.018 in 
diameter test pin 

Withdrawal force: 0.5 oz (14 g) per pin min using 0.018 in 
diameter test pin 

Electrical 

Contact rating: 1.0 A per contact 

Contact resistance: 20 mQ max initial 

Insulation resistance: 1000 MQ at 500 V dc per 
MIL-STD 1344, Method 3003.1 

Dielectric withstanding voltage: 1000 V ac rms 
per MIL-STD 1344, Method 3001.1 

Capacitance: 1.0 pF max per MIL-STD 202, Method 305 

Environmental 

Operating temperature: — 65°C to 125°C, gold; - 40°C to 
100°C, tin 

Corrosive atmosphere: 10 mf max contact resistance 
change when exposed to 22% ammonium sulfide for 
4 hours 

Gas tight: 10 m2 max contact resistance change when 
exposed to nitric acid vapor for 1 hour 

Temperature soak: 10 m{Q max contact resistance change 
when exposed to 105°C temperature for 48 hours 

Shell life: 12 months min 


MATERIALS 

Body — PBT polyester U/L94-VO rating 

On request, G10/FR4 or Mylar film 

Outer sleeve — Machined Brass (QQ-B-626) 

inner contact — Beryllium copper (QQ-C-530) heat treated 
Plating: (specified by part number) 


PIN GRID ARRAY 


1.3/2.0 
(0.05/0.08) TYP 0.10/0.12) (0. 100) TYP NONCUMULATIVE 
3,6/4,6 
(0.14/0. 18) 
| 33 
2. eae 61 (0.021) DIA L. os3)" DIA 
(0.105/0.150) 


IC SOCKETS 
HIGH DENSITY PIN GRID ARRAY 


WIDE-TAPERED 
ENTRY 


PRECISION 
MACHINED 
SLEEVE 


PRECISION 
SIX-FINGERED 
INNER CONTACT 


Inner contact — 30 pin gold over 50 pin nickel or 100 pin 
tin/lead over 50 xin nickel 

Outer sleeve: — 10 pin gold over 50 yin nickel or 50 pin 
tin/lead over 50 yin nickel 


PART NUMBER SYSTEM 


C X G XX — XXX Fam RH 


Pin ce 


awe 0. 10 | | 3-0.510 tong | = i086, 150 


Plating 


Body Style and Orientation 
Contact Loading Pattern 


Pin ; 
Grid Number of Pins 
Array 024 to 324 


Overall Grid Size 
5x5=05 to 18x 18=18 
BODY MATERIAL 
G — Glass Filled Epoxy 
P — PBT Polyester 


TI Socket 


Insulator Size 


(0.950) 24,13 (0.800) 20,32 
(1.050) 26,67 (0.900) 22,86 
(1.150) 29,21 (1.000) 25,40 
(1.250) 31,75 (1.100) 27,94 


(1.350) 34,29 (1.200) 30,48 
(1.450) 36,83 (1.300) 33,02 
(1.550) 39,37 (1.400) 35,56 
(1.650) 41,91 (1.500) 38,10 
(1.750) 44,45 (1.600) 40,64 
(1.850) 46,99 (1.700) 43,18 


TNoncumulative 
Dimensions in parentheses are inches 
Consult factory for detailed information 
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IC SOCKETS 
BURN-IN/TEST PIN GRID ARRAY 


PERFORMANCE SPECIFICATIONS 


Mechanical 
Accommodates IC leads per specific IC device 
Recommended PCB thickness range: 0.062 in to 0.092 in 
Recommended PCB hole size range: 0.032 in to 0.042 in 
Durability: 5000 cycles, 10 mQ max contact resistance 
change per MIL-STD 1344, Method 2016 
Solderability: per MIL-STD 202, Method 208 
Electrical 
Contact rating: 1.0 A per contact 
Contact resistance: 20 mf max initial 
Insulation resistance: 1.0 MQ at 500 V dc per ZERO INSERTION 
MIL-STD 1344, Method 3003.1 CLOSED BOTTOM FORCE DUAL 
Dielectric withstanding voltage: 700 V ac rms per DESIGN BEAM CONTACT 
MIL-STD 1344, Method 3001.1 SYSTEM 
Capacitance: 1.0 pF max per MIL-STD 202, Method 305 
Environmental PART NUMBER SYSTEM 
Operating temperature: —65°C to 170°C C XX X — XXX - XX — 
Humidity: 10 m2 max contact resistance change when | 
PI 
3 


tested per MIL-STD 202, Method 103B 
Temperature soak; 10 mQ max contact resistance change 
when exposed to 105°C temperature for 48 hours 
Shelf life: 12 months max 


01 
lating 
7 — overall gold plate 


Number of positions 
MATERIALS : ; 
Body — CZF Series: PPS (polyphenylen sufide) glass filled Configuration 
U/L 94 VO rating, - 65°C to 170°C W-1txttx2 


Contact — Beryllium copper 
Plating: t Overall gold plate min 4 pin over min 70 pin nickel 
plating TI Series socket 


Style ZF — Zero force 


tFor additional plating option consult the factory. 
BURN-IN TEST PIN GRID ARRAY 


83,82 ‘ 
(3.30) 4 8,9 35,05 
se {0.354) (1.38) MAX 
——2,54 

(0.100) 


6,99 
Y (0.275) 
121 x 2 25,40 
(1.00) 
29,60 
(1.165) 
morte erw eK | 
. - 


Dimensions in parentheses are inches 
Contact factory for detailed information 


48,00 — 59,94 

(1.89) — (2.36) 
14,98 
(0.590) 

33.02 

(1.30) 
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Vendors of crystals suitable for use with TMS320 devices are listed below. 


RXD, Inc. N.E.L. Frequency Controls, Inc. 
Norfolk, NB Burlington, WI 
(800) 228-8108 (414) 763-3591 


CTS Knight, Inc. 
Contact the local distributor. 
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G. ROM Codes 


Board space is often a critical concern in many DSP applications. In order to 
reduce chip count and provide the customer with a single-chip solution, Texas 
Instruments offers microcomputer versions for TMS320C2x (second- 
generation TMS320) devices. The on-chip ROM of these processors can be 
masked with the customer’s own code. This allows the user to take advantage 
of the general-purpose features of Tl’s digital signal processors while at the 
same time customizing the processor to suit a specific application. 


To facilitate design, all prototype work is performed using a standard 
TMS320C2x microprocessor. TMS320C2x development tools permit a de- 
signer to test and refine algorithms for immediate results. When the algorithm 
has been finalized, the customer can submit the code to Texas Instruments to 
be masked into the on-chip ROM of the device. 


The MP/MC (microprocessor/microcomputer) mode, offered on maskable 
TMS320C2x devices such as the TMS320C25, can shorten design and field 
upgrade cycle times, thereby reducing expense. This mode permits the cus- 
tomer to use the TMS320C2x as a standard device operating out of external 
program memory. When TMS320C2x code is altered during design, the de- 
lays associated with new silicon processing are avoided. Field upgrade cycle 
times and the associated expense of inventory obsolescence when the code 
is altered are also avoided. Note that the TMS32020 has no on-chip ROM 
and operates in the microprocessor mode only. 


An entire algorithm or an often-used routine may be masked into the on-chip 
ROM space of a TMS320C2x device. TMS320C2x programs can also be ex- 
panded using external memory. With a reduced chip count and this program 
memory flexibilty, multiple functions can be more easily implemented in a 
single hardware device, thus enhancing a product's capabilities. The 
TMS320C25 with 4K words of on-chip ROM can be ordered as a masked 
device. The customer’s code must fit within the specified ROM size of the 
processor. 


Figure G-1 illustrates the procedure flow for implementing TMS320C2x 
masked parts. With any masked device order, there is a one-time charge of 
$6000 for mask tooling which includes 10 prototypes. A non-cancellable 
minimum production order per year of 5000 units is required for the 
TMS320C25. 
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CUSTOMER TMS320C2x DESIGN — 


CUSTOMER SUBMITS: 
- TMS320C2x NEW CODE RELEASE FORM 

— PRINT EVALUATION AND ACCEPTANCE FORM (PEAF) 

- PURCHASE ORDER FOR MASK CHARGE/10 PROTOTYPES 
— TMS320C2x CODE 


TEXAS INSTRUMENTS RESPONDS: 
~- CUSTOMER CODE INPUT INTO TI SYSTEM 
— CODE SENT BACK TO CUSTOMER FOR VERIFICATION 


CUSTOMER 
APPROVES 
ALGORITHM 


Tl PRODUCES 19 PROTOTYPES 


CUSTOMER 
APPROVES 
PROTOTYPES (MINIMUM 
PRODUCTION ORDER 
REQUIRED) 


TMS320C2x PRODUCTION 


Figure G-1. TMS320C2x ROM Code Flowchart 
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Leadtimes for the first 10 prototype units begin when the customer has 
formally verified that Tl has recorded his code correctly. Leadtimes for the first 
production order begin once the customer formally approves the masked pro- 
totypes. The typical leadtime for masked TMS320C2x prototypes is 8 to 10 
weeks and for masked TMS320C2x production 12 to 16 weeks. Texas In- 
struments constantly strives to improve these leadtimes and reserves the right 
to make changes at any time. Please contact the nearest Tl Sales Office for 
current leadtimes, further information on these procedures, and confirmation 
of the mask/production requirements. 


A TMS320C2x ROM code may be submitted in one of the following formats 
(the preferred media is 5 1/4” floppies): 


FLOPPY: TI Cross-Assembler Format 
EPROM: TMS27C64, TMS2508, TMS2516, TMS2532, TMS2564 
PROM: TBP28S166, TBP28S86 


When a code is submitted to Texas Instruments for a masked device, the code 
is reformatted to accommodate the Tl mask generation system. System level 
verification by the customer is therefore necessary. Although the code has 
been reformatted, it is important that the changes remain transparent to the 
user and not affect the execution of the algorithm. The formatting changes 
made involve deletion of all address tags (unnecessary in a ROM code device) 
and addition of data in the reserved locations of the ROM for device ROM test. 
Note that because these changes have been made, a checksum comparison is 
not a valid means of verification. 


ROM code algorithms may also be submitted by secure electronic transfer via 
a modem. Contact the nearest TI sales office for further information. 


With each masked device order, the customer must sign a disclaimer stating: 


"The units to be shipped against this order were assembled, for 
expediency purposes, on a prototype (i.e., non-production qualified) 
manufacturing line, the reliability of which is not fully characterized. 
Therefore, the anticipated inherent reliability of these prototype units 
cannot be expressly defined.” 


and a release stating: 
“Any masked ROM device may be resymbolized as T! standard prod- 
uct and resold as though it were an unprogrammed version of the 
device at the convenience of Texas Instruments.” 


ROM codes will be deleted from the TI system after one year from the last 
delivery. 


H. Quality and Reliability 


The quality and reliability performance of Texas Instruments Microprocessor 
and Microcontroller Products, which includes the three generations of 
TMS320 digital signal processors, relies on feedback from: 


e Our customers 


e Our total manufacturing operation from front-end wafer fabrication to 
final shipping inspection 


e Product quality and reliability monitoring. 


Our customer’s perception of quality must be the governing criterion for 
judging performance. This concept is the basis for Texas Instruments Corpo- 
rate Quality Policy, which is as follows: 


"For every product or service we offer, we shall define the require- 
ments that solve the customer’s problems, and we shall conform to 
those requirements without exception.” 


Texas Instruments offers a leadership reliability qualification system, based on 
years of experience with leading-edge memory technology as well as years of 
research into customer requirements. Quality and reliability programs at T! are 
therefore based on customer input and internal information to achieve con- 
stant improvement in quality and reliability. 
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H.1 Reliability Stress Tests 


H-2 


Accelerated stress tests are performed on new semiconductor products and 
process changes to ensure product reliability excellence. The typical test en- 
vironments used to qualify new products or major changes in processing are: 


High-temperature operating life 

Storage life 

Temperature cycling 

Biased humidity 

Autoclave 

Electrostatic discharge 

Package integrity 

Electromigration 
Channel-hot electrons (performed on geometries less than 2.0 um). 


Typical events or changes that require internal requalification of product in- 
clude: 


@ New die design, shrink, or layout 


e Wafer process (baseline/control systems, flow, mask, chemicals, gases, 
dopants, passivation, or metal systems) 


© Packaging assembly (baseline contro! systems or critical assembly 
equipment) 

@ Piece parts (such as lead frame, mold compound, mount material, bond 
wire, or lead finish) 


@ Manufacturing site. 


TI reliability control systems extend beyond qualification. Total reliability 
controls and management include product ramp monitor as well as final pro- 
duct release controls. MOS memories, utilizing high-density active elements, 
serve as the leading indicator in wafer-process integrity at Tl MOS fabrication 
sites, enhancing all MOS logic device yields and reliability. Tl places more 
than 200,000 MOS devices per month on reliability test to ensure and sustain 
built-in product excellence. 


Table H-1 lists the microprocessor and microcontroller reliability tests, the 
duration of the test, and sample size. The following defines and describes 
those tests in the table. 


AOQQ (Average Outgoing Quality) Amount of defective product in a pop- 
ulation, usually expressed in terms of 
parts per million (PPM). 


FIT (Failure In Time) Estimated field failure rate in number 
of failures per billion power-on device 
hours; 1000 FITS equals 0.1 percent 
fail per 1000 device hours. 


Operating lifetest Device dynamically exercised at a high 
ambient temperature (usually 125°C) 
to simulate field usuage that would 
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High-temperature storage 


Biased humidity 


Autoclave (pressure cooker) 


Temperature cycle 


Thermal shock 


PIND 


Mechanical Sequence: 
Fine and gross leak 
Mechanical shock 


PIND (optional) 
Vibration, variable frequency 


Constant acceleration 


Fine and gross leak 


expose the device to a much lower 
ambient temperature (such as 55°C). 
Using a derived high temperature, a 
55°C ambient failure rate can be cal- 
culated. 


Device exposed to 150°C unbiased 
condition. Bond integrity is stressed in 
this environment. 


Moisture and bias used to accelerate 
corrosion-type failures in plastic 
packages. Conditions include 85°C 
ambient temperature with 85-percent 
relative humidity (RH). Typical bias 
voltage is +5 V and ground on alter- 
nating pins. 


Plastic-packaged devices exposed to 
moisture at 121°C using a pressure of 
One atmosphere above normal pres- 
sure. The pressure forces moisture 
permeation of the package and accel- 
erates corrosion mechanisms (if pres- 
ent) on the device. External package 
contaminates can also be activated 
and caused to generate inter-pin cur- 
rent leakage paths. 


Device exposed to severe temperature 
extremes in an alternating fashion 
(-65°C for 15 minutes and 150°C for 
15 minutes per cycle) for at least 1000 
cycles. Package strength, bond qual- 
ity, and consistency of assembly pro- 
cess are stressed in this environment. 


Test similar to the temperature cycle 
test, but involving a liquid-to-liquid 
transfer, per MIL-STD-883C, Method 
1011. 


Particle Impact Noise Detection test.: 
A non-destructive test to detect loose 
particles inside a device cavity. 


Per MIL-STD-883C, Method 1014.5 
Per MIL-STD-883C, Method 2002.3, 
1500 g, 0.5 ms, Condition B 

Per MIL-STD-883C, Method 2020.4 
Per MIL-STD-883C, Method 2007.1, 
20 g, Condition A 

Per MIL-STD-883C, Method 2001.2, 
20 kg, Condition D, Y1 Plane min 
Per MIL-STD-883C, Method 1014.5 
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Electrical test 


Thermal! Sequence: 
Fine and gross leak 
Solder heat (optional) 
Temperature cycle 

(10 cycles minimum) 
Thermal shock 

(10 cycles minimum) 
Moisture resistance 
Fine,and gross leak 
Electrical test 


Thermal/Mechanical Sequence: 
Fine and gross leak 
Temperature cycle 
(10 cycles minimum) 
Constant acceleration 


Fine and gross leak 
Electrical test 


Electrostatic discharge 
Solderability 

Solder heat 

Salt atmosphere 

Lead pull 


Lead integrity 


Electromigration 


’ Resistance to solvents 


To data sheet limits 


Per MIL-STD-883C, Method 1014.5 
Per MIL-STD-750C, Method 1014.5 
Per MIL-STD-883C, Method 1010.5, 
-65 to +150°C, Condition C 

Per MIL-STD-883C, Method 1011.4, 
-55 to +125°C, Condition B 

Per MIL-STD-883C, Method 1004.4 
Per MIL-STD-883C, Method 1014.5 
To data sheet limits 


Per MIL-STD-883C, Method 1014.5 
Per MIL-STD-883C, Method 1010.5, 
-65 to +150°C, Condition C 

Per MIL-STD-883C, Method 2001.2, 
30 kg, Y1 Plane 

Per MIL-STD-883C, Method 1014.5 
To data sheet limits 


Per MIL-STD-883C, Method 3015 
Per MIL-STD-883C, Method 2003.3 
Per MIL-STD-750C, Method 2031, 
10 sec 

Per MIL-STD-883C, Method 1009.4, 
Condition A, 24 hrs min 

Per MIL-STD-883C, Method 2004.4, 
Condition A 

Per MIL-STD-883C, Method 2004.4, 
Condition B1 

Accelerated stress testing of con- 
ductor patterns to ensure acceptable 
lifetime of power-on operation 

Per MIL-STD-883C, Method 2015.4 
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Table H-1. Microprocessor and Microcontroller Tests 


TEST DURATION SAMPLE SIZE 
PLASTIC CERAMIC 


Operating life, 125°C, 5.0 V 1000 hrs 
Operating life, 150°C, 5.0 V 1000 hrs 
Storage life, 150°C 1000 hrs 
Biased 85°C/85 percent RH, 5.0 V 1000 hrs 
Autoclave, 121°C, 1 ATM 240 hrs 

Temperature cycle, -65 to 150°C 1000 cyc 
Thermal shock, -65 to 150°C 500 cyc 
Electrostatic discharge, +2 kV 

Latch-up (CMOS devices only) 

Mechanical sequence 

Thermal sequence 

Thermal/mechanical sequence 

PIND 

Internal water vapor 

Solderability 

Solder heat 

Resistance to solvents 

Lead integrity 

Lead pull 

Lead finish adhesion 

Salt atmosphere 

Flammability (UL94-VO) 

Thermal impedance 


*If junction temperature does not exceed plasticity of package. 
{ 
e 


Table H-2 provides a list of the TMS320C2x devices, the approximate number 
of transistors, and the equivalent gates. The numbers have been determined 
from design verification runs. 


Table H-2. TMS320C2x Transistors 


DEVICE ‘ # TRANSISTORS | # GATES 
NMOS: TMS32020 Ceo 
CMOS: TMS320C25 160K | 40K 


Tl Qualification test updates are available upon request at no charge. T! will 
consider performing any additional reliability test(s), if requested. For more 
information on TI quality and reliability programs, contact the nearest T! field 
sales office. 
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Note: 


Texas Instruments reserves the right to make changes in MOS Semicon- 
ductor test limits, procedures, or processing without notice. Unless prior 


arrangements for notification have been made, TI advises all customers to 
reverify current test and manufacturing conditions prior to relying on 
published data. 
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BACC 4-36 
Branch Unconditionally 
B 4-35 
branches 3-25, 3-37, 5-28, C-4 
bulletin board E-10 
burst-mode operation 3-61 
BV 5-34 
Branch on Overflow 4-60 
BZ 
Branch if Accumulator Equals 
Zero 4-61 


C 
C compiler E-5 
CALA 5-7 

Call Subroutine Indirect 4-62 
CALL 5-7 

Call Subroutine 4-64 
Call Subroutine 

CALL 4-64 
Call Subroutine Indirect 

CALA 4-62 
calls 5-28, C-4 
carry bit (C) 3-25, 3-43, 5-50, C-6 
central arithmetic logic unit (CALU) 3-8, 

3-22 

CLKOUT1 2-6, 3-44, 3-48 
CLKOUT2 2-6, 3-48 
CLKR 2-6, 3-56, C-7 
CLKX 2-6, 3-56, C-7 
clock phases 3-48, C- 
clock timing 3-44, 3- 
CMPL 


Complement Accumulator 4-66 
CMPR 


5 
8 


index 


Compare Auxiliary Register with Aux- 
iliary Register ARO 4-67 
CNF 3-43 
CNFD 3-13, 5-25, C-5 
Configure Block as Data 
Memory 4-68 
CNFP 3-13, 5-25, C-5 
Configure Block as Program 
Memory 4-69 
codec interface 6-34 
codecs (T!) F-2 
companding 5-60 
A-law/p-law 5-60 
sign-magnitude data 5-60 
two's-complement data 5-60 
Complement Accumulator 
CMPL_ 4-66 
computed GOTO 5-14 
Conditional Subtract 
SUBC 4-171 
Configure Block as Data Memory 
CNFD 4-68 
Configure Block as Program Memory 
CNFP 4-69 
context switching 5-16 
continuous-mode operation 3-62, 3-64, 
3-66 
control system 6-47 
convolution 5-36 
crystal oscillator circuit 6-5 
Crystals F-33 
cycle timings (instructions) C-4, D-1 


D 


D/A converters (Tl) F-2 

D/A interface 6-39 

data address bus (DAB) 3-8, 3-19 

data bus (D15-D0O) 2-4, 3-8 

data memory 3-11 

data memory addressing 3-19 

data memory expansion 3-11 

data memory page pointer (DP) 3-8, 
3-20, 3-43 

Data Move in Data Memory 

DMOV 4-71 

data moves 3-20, 5-35 

data receive register (DRR) 3-9, 3-16, 
3-56, C-7 

data transmit register (DXR) 3-9, 3-16, 
3-56 

decode (pipeline) 3-29 

denormalization 5-47 


development support E-1 
analog interface board (AIB) £-7 
C compiler E£-5 
DFDP (digital filter design 
package) E-9 
DSP Software Library E-9 
emulator (XDS)_  E-5 
macro assembler/linker E-3 
simulator E-4 
SoftWare Development System 
(SWDS) E-4 
TMS320 Design Kit E-8 
TMS320 DSP bulletin board 
service E-10 
TMS320 DSP hotline £-10 
XDS/22 upgrade’ E-7 
digital filter design package 


(DFDP) E-9 
digital filters 5-61 
DINT 


Disable Interrupt 4-70 
direct address bus (DRB) 3-8, 3-19 
direct addressing mode 3-19, 4-2, C-2 
direct memory access (DMA) 3-71, 6-29 
Disable Interrupt 

DINT 4-70 
disk drives 6-47 
division 5-42 
DMOV_ 3-20, 5-35 

Data Move in Data Memory 4-71 
DR 2-6, 3-56 
DS- 2-4, 3-13 
DSP Software Library E-9 
DX 2-6, 3-56 


E 


echo cancellation 6-45 
EINT 3-53, 3-54 

Enable Interrupt 4-73 
electrical specifications A-1 
emulator (XDS) _ £E-5 
Enable Interrupt 

EINT 4-73 
EPROM interfacing 6-19 
EPROM memories (T!) F-2 
EXAMPLE 

Example Instruction 4-18 
Example Instruction 

EXAMPLE 4-18 
Exclusive-OR with Accumulator 

XOR 4-184 
execute (pipeline) 3-29 
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Index 


extended-precision arithmetic 5-49 
external clock (CLKX) 3-56 
external flag (XF) 3-44, 3-50 
external memory interface 3-47 
external program/data access 3-40 


F 


Fast Fourier Transforms (FFT) 5-68 
‘fetch (pipeline) 3-29 
filtering 5-61 
FIR filters 5-61 
fixed-point conversion 4-116 
floating-point arithmetic 3-24, 5-45 
floating-point conversion 3-27, 4-79, 
5-45 
format bit (FO) 3-43, 3-56 
Format Serial Port Registers 
FORT 4-74 
FORT C-7 
Format Serial Port Registers 4-74 
frame sync pulses 3-56, 3-60, 3-62 
frame synchronization mode bit 
(FSM) 3-43, 3-56, C-6 
framing contro! 3-60 


FSR 2-6, 3-56 
FSX 2-6, 3-56 
G 

gates H-5 


global memory 3-69, 6-32 

global memory allocation register 
(GREG) 3-8, 3-16, 3-69. 3-70, 6-32 

graphics 6-47 


H 


hardware applications 6-1 
direct memory access (DMA) 6-29 
disk drives 6-47 
echo cancellation 6-45 
global memorv 6-32 
graphics 6-47 
higti-speed control 6-47 
high-speed modem 6-45 
image processing 6-47 
instrumentation 6-48 
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interfacing memories 6-10 
interfacing peripherals 6-34 
numeric processing 6-48 
robotics 6-47 
system applications 6-45 
system control circuitry 6-3 
user target design using XDS__ 6-7 
voice coding 6-46 

hardware stack 3-10, 3-28, 5-9, 5-10, 

5-16 

Harvard architecture 1-3 

HOLD- 2-5, 3-39, 3-53, 3-71 

hold mode (HM) 3-43, C-6 

HOLDA- 2-5, 3-39, 3-71 

hotline £-10 


1/0 interface 3-47, 6-42 
1/O port addressing 3-47, 6-42 
IACK- 2-5, 3-52, 3-54 
IDLE 3-71, C-4 
Idle Until Interrupt 4-75 


Idle Until Interrupt 


IDLE 4-75 
IR filters 5-61 
image processing 6-47 
immediate addressing mode 3-19, 4-9 
IN 3-47, 5-23 

Input Data from Port 4-76 
indexed addressing 5-49 
indirect addressing mode 3-19, 4-4, 5-35 
initiatzation 3-66, 5-2 
Input Data from Port 

IN 4-76 
instruction cycle timings D-1 
instruction pipeline 3-29 
instruction register (IR) 3-8, 3-30 
instruction set summary 4-12 
instructions (assembly language) 4-1 
instrumentation 6-48 
interface timing analysis 6-27 
interfacing memories 6-10 
interfacina peripherals 6-34 
internal hardware summary 3-7 
interrupt acknowledge (IACK) 3-52, 

3-54 
interrupt flag register (IFR) 3-8, 3-52, 
-16 


interrupt mask register (IMR) 3-8, 3-16, 
3-52, 5-16 

interrupt mode (INTM) 3-43, 3-52, 3-53, 
3-54 


Index 


interrupts 2-5, 3-39, 3-52, 3-54, 3-71, 
5-16, C-4 
external interrupt interface 3-53 
operation 3-52 
priorities 3-52, 5-22 
service routine 5-16 
vector locations 3-52, 5-16 
IS- 2-4, 3-13 


K 


key features 1-4 


LAC 
Load Accumulator with Shift 4-77 
LACK 
Load Accumulator Immediate 
Short 4-78 
LACT 3-24, 5-45 
Load Accumulator with Shift Specified 
by T Register 4-79 
LALK 
Load Accumulator Long Immediate 
with Shift 4-81 
LAR 
Load Auxiliary Register 4-82 
LARK 
Load Auxiliary Register Immediate 
Short 4-84 
LARP 
“Load Auxiliary Register Pointer 4-85 
LDP 
Load Data Memory Page 
Pointer 4-86 
LDPK 
Load Data Memory Page Pointer Im- 
mediate 4-87 
lett shifts © 3-27, 5-31 
linker E-3 
Load Accumulator Immediate Short 
LACK 4-78 
Load Accumulator Long Immediate with 
Shift 
LALK 4-81 
Load Accumulator with P Register 
PAC 4-121 


Load Accumulator with Shift 
LAC 4-77 
Load Auxiliary Register 
LAR 4-82 
Load Auxiliary Register Immediate Short 
LARK 4-84 
Load Auxiliary Register Long Immediate 
LRLK 4-89 
Load Auxiliary Register Pointer 
LARP 4-85 
Load Data Memory Page Pointer 
LDP 4-86 
Load Data Memory Page Pointer Immedi- 
ate 
LDPK 4-87 
Load High P Register 
LPH 4-88 
Load Status Register STO 
LST 4-90 
Load Status Register ST1 
LST1 4-92 
Load T Register 
LT 4-94 
Load T Register and Accumulate Previous 
Product 
LTA 4-95 
Load T Register, Subtract Previous Product 
LTS 4-100 
logical operations 5-31 
loop control 5-13 


LPH 
Load High P Register 4-88 
LRLK 
Load Auxiliary Register Long Immedi- 
ate 4-89 
LST 


Load Status Register STO 4-90 
LST1 
Load Status Register ST1 4-92 
LT 
Load T Register 4-94 
LTA 5-39 
Load T Register and Accumulate Pre- 
vious Product 4-95 
LTD 5-39 
Load T Register, Accumulate Previous 
Product, and Move Data 4-97 
LTP 
Load T Register and Store P Register 
in Accumulator 4-99 
LTS 
Load T Register, Subtract Previous 
Product 4-100 
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M 


MAC _ 5-37, 5-38, C-4 
Multiply and Accumulate 4-101 
MACD _ 5-37, 5-38, C-4 
Multiply and Accumulate with Data 
Move 4-104 
macro assembler E-3 
MAR 
Modify Auxiliary Register 4-107 
mask options 3-12, G-1 
memory 3-11, 5-23, C-2 
addressing modes 3-19, 4-2 
auxiliary registers 3-16 
block moves 5-23 
data memory 3-11 
global memory 3-69, 6-32 
memory expansion 3-11, 3-12 
memory maps 3-13 
memory-mapped registers 3-16 
program memory 3-12 
memory addressing modes 3-19, 4-2 
direct addressing 3-19, 4-2, C-2 
immediate addressing 3-19, 4-9 
indirect addressing 3-19, 4-4 
memory combinations 3-47 
‘memory interface 6-10 
memory management 5-23 
memory maps 3-13 
memory products (Tl) F-2 
memory-mapped registers 3-13, 3-16 


microcall stack (MCS) register 3-9, 3-30 


microcomputer mode 2-5, 3-12, 3-15 
microprocessor mode 2-5, 3-12, 3-15 
modem 6-45 
Modify Auxiliary Register 
MAR 4-107 
MP/MC- 2-5, 3-15 
MPY 5-38, C-3 
Multiply 4-109 
MPYA 
Multiply and Accumulate Previous 
Product (TMS320C25) 4-110 
MPYK C-3 
Multiply Immediate 4-111 
MPYS 
Multiply and Subtract Previous Prod- 
uct (TMS320C25) 4-112 
MPYU 3-26 | 
Multiply Unsigned 
. (TMS320C25) 4-113 
-MS/PC-DOS__ E-9 
MSC- 2-5, 6-16 
multiplexed external data bus 3-35 
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multiplication 5-37, 5-54, C-3 
multiplier 3-9, 3-26, 5-37 
Multiply 
MPY 4-109 
Multiply and Accumulate 
MAC 4-101 
Multiply and Accumulate with Data Move 
MACD 4-104 
Multiply Immediate 
MPYK 4-111 
Multiply Unsigned (TMS320C25) 
MPYU 4-113 
multiprocessing 3-68 


N 


NEG 
Negate Accumulator 4-114 
Negate Accumulator 
NEG 4-114 
No Operation 
NOP 4-115 
nomenclature E-14 
NOP 
No Operation 4-115 
NORM _ 3-24, 5-45, C-4 
Normalize Contents of 
Accumulator 4-116 
normalization 5-45, 5-47 
Normalize Contents of Accumulator 
NORM 4-116 
numeric processing 6-48 


O 


on-chip program RAM execution 5-28 

on-chip program ROM 3-14 

on-chip RAM 3-9, 3-13, 5-25 

on-chip RAM configuration centre: bit 
(CNF) 3-43 

OR 


OR with Accumulator 4-118 
OR Immediate with Accumulator with Shift 
ORK 4-119 
OR with Accumulator 
OR 4-118 
ordering information E-11 
ORK 
OR Immediate with Accumulator with 
Shift 4-119 


Index 


oscillator circuit 6-5 
OUT 3-47, 5-23 
Output Data to Port 4-120 
Output Data to Port 
OUT 4-120 
overflow flag (OV) 3-43, 5-34 
overflow managment 5-34 
overflow mode (OVM) 3-44, 5-31, 5-34 
overflow saturation mode 3-24 


P 


P register (PR) 3-25. 3-26, 5-37 
PAC 


Load Accumulator with P 
Register 4-121 
part numbers” E-11 
PC stack 5-7 
PC/MS-DOS_ E-9 
period register (PRD) 
5-11, C-5 
peripheral interface 6-34 
PID control 5-75 
pinouts 2-2 
pipeline operation 3-29 
decode 3-29 
execute 3-29 
fetch 3-29 
prefetch 3-29 
three-level pipeline 3-29 
two-level pipeline 3-29 
wait states 3-29 
PM bits 3-44. 5-42, C-2 
POP 3-29 
Pop Top of Stack to Low 
Accumulator 4-122 
Pop Top of Stack to Data Memory 
POPD 4-124 
Pop Top of Stack to Low Accumulator 
POP 4-122 
POPD 3-29,5-10 
Pop Top of Stack to Data 
Memory 4-124 
powerdown mode _ 3-46 
powerup reset circuit 6-3 
prefetch (pipeline) 3-29 
prefetch counter (PFC) 3-9, 3-30 
product quality/reliability H-1 
product register (PR) 3-9, 3-26, 5-37 
product shift mode(PM) bits 3-27, 3-44, 
5-32, 5-42, C-6 
program access (on-chip) 3-40 
program address bus (PAB) 3-9 


3-9, 3-16, 3-44, 


program bus 3-9 
program counter (PC) 
program memory 3-12 
program memory expansion 3-12 
PROM interfacing 6-11 
prototype devices G-1 
PS- 2-4, 3-13 
PSHD 3-29, 5-10 
Push Data Memory Value onto 
Stack 4-126 
PUSH. 3-29 
Push Low Accumulator onto 
Stack 4-128 


3-9, 3-28, 3-37 


Push Data Memory Value onto Stack 


PSHD 4-126 
Push Low Accumulator onto Stack 
PUSH 4-128 


0] 


quality/reliability H-1 
queue instruction register (QIR) 3-9, 
0 


Q15 format 3-27, 5-45 


R 

R/W- 2-4 

RAM interfacing 6-24 
RC 


Reset Carry Bit (TMS320C25) 4- 
129 


READY 2-4, 3-70 
receive framing synchronization signal 
(FSR) 3-56 
receive shift register (RSR) 3-9, 3-56 
received serial data (DR) 3-56 
reliability tests H-2 
repeat counter (RPTC) 
3-53, 5-13 
reset (RS-) 2-5, 3-39, 3-41, 3-52, C-5 
Reset Carry Bit (TMS320C25) 
RC 4-129 
reset circuit 6-3 
Reset External Flag 
RXF 4-142 | 
Reset Hold Mode (TMS320C25) 
RHM 4-132 
Reset Overflow Mode 
ROVM 4-135 


3-9, 3-29, 3-46, | 
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Reset Serial Port Transmit Mode 
RTXM 4-141 
Reset Sign-Extension Mode 
RSXM 4-139 
Reset Test 
Control! Flag (TMS320C25) 
RTC 4-140 
RET 3-37, 3-53, 5-7 
Return from Subroutine 4-130 
Return from Subroutine 
RET 4-130 
reverse-carry (rc) propagation 3-18, 4-5, 
4-7, 5-68 
RFSM 
Reset Serial Port Frame Synchroniza- 
tion Mode (TMS320C25) 4-131 
RHM 
Reset Hold Mode 
(TMS320C25) 4-132 
right shift 3-27, 5-31 
RINT 3-52, C-7 
robotics 6-47 
ROL 
Rotate Accumulator Left 
(TMS320C25) 4-133 
ROM codes G-1 
ROR 
Rotate Accumulator Right 
(TMS320C25) 4-134 
Rotate Accumulator Left (TMS320C25) 
ROL 4-133 
Rotate Accumulator Right (TMS320C25) 
ROR 4-134 
ROVM _ 3-24, 5-31, 5-34 
Reset Overflow Mode 4-135 
RPT 3-46, 3-53, 5-13 
Repeat Instruction as Specified by 
Data Memory Value 4-136 
RPTC 3-29 
RPTK 3-46, 3-53, 5-13 
Repeat Instruction as Specified by Im- 
mediate Value 4-138 
RSXM_ 5-31 
Reset Sign-Extension Mode 4-139 
RTC 
Reset Test 
Control Flag (TMS320C25) 4- 
140 
RTXM 
Reset Serial Port Transmit Mode 4- 
141 
RXF 3-50 
Reset External Flag 4-142 
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SACH C-4 
Store High Accumulator with 
Shift 4-143 
SACL 3-33, C-4 
Store Low Accumulator with 
Shift 4-144 
SAR 
Store Auxiliary Register 4-145 
SBLK 
Subtract from Accumulator Long Im- 
mediate with Shift 4-147 
SBRK 
Subtract from Auxiliary Register Short 
Immediate (TMS320C25) 4-148 
SC 
Set Carrv Rit (TMS320C25) 4-149 
scaling 5-35 
scaling shifter 3-23 
serial port 3-56, C-6 
burst-mode operation 3-61 
continuous-mode operation 3-62, 
3-64, 3-66 
timing and framing control 3-60 
transmit/receive operations 3-58 
serial-port clock (CLKR) 3-56 
Set Carry Bit (TMS320C25) 
SC 4-149 
Set External! Flag 
SXF 4-178 | 
Set Hold Mode (TMS320C25) 
SHM 4-154 
Set Overflow Mode 
SOVM 4-155 
Set P Register Output Shift Mode 
SPM 4-159 
Set Serial Port Transmit Mode 
STXM 4-168 
Set Sign-Extension Mode 
SSXM 4-166 
Set Test 
Control Flag (TMS320C25) 
STC 4-167 
SFL 3-206, 5-35 
Shift Accumulator Left 4-150 
SFR 3-26, 5-34 
Shift Accumulator Right 4-151 
SFSM 
Set Serial Port Frame Synchronization 
Mode (TMS320C25) 4-153 
Shift Accumulator Left 
SFL 4-150 
Shift Accumulator Right 


Index 


SFR 4-151 
shift modes 3-27, 3-44, 5-31, 5-42 
snifters 3-10, 5-34 
accumulator 3-23 
accumulator output 3-23, 5-34 
product register output 3-23, 5-34 
scaling shifter 3-23 
SHM 
Set Hold Mode (TMS320C25) 4- 
154 
signal descriptions 2-1 
sign-extension mode 5-31 
sign-extension mode bit (SXM) 3-26, 
3-44, 5-31, C-6 
sign-magnitude data 5-31, 5-60 
simulator E-4 
single-instruction loops 5-13 
sockets (Tl) F-28 
software applications 5-1 
SoftWare Development System 


(SWDS) E-4 
Software Interrupt 
TRAP 4-183 


software library E-9 
software stack 5-10 
software stack expansion 5-10 
SOVM | 3-24, 5-31, 5-34 
Set Overflow Mode 4-155 
SPAC 
Subtract P Register from 
Accumulator 4-156 
specifications A-1 
SPH 
Store High P Register 
(TMS320C25) 4-157 
SPL 
Store Low P Register 
(TMS320C25) 4-158 
SPM__ 5-32, 5-34 
‘Set P Register Output Shift 
Mode 4-159 
SQRA 3-26, 5-41 
Square and Accumulate Previous Pro- 
duct 4-160 
SQRS_~ 3-26, 5-41 
Square and Subtract Previous 
Product 4-161 
Square and Accumulate Previous Product 
SQRA 4-160 
Square and Subtract Previous Product 
SOQRS 4-161 
square-root routine 5-7 
SST 
Store Status Register STO 4-162 
SST1 


Store Status Register ST1 4-164 
SSXM 5-31 

Set Sign-Extension Mode 4-166 
stack 3-28, 5-7, 5-9 
static RAM intertacing 6-24 
status registers 3-10, 3-42, 5-31, C-2, 
STC 

Set Test 

Control Flag (TMS320C25) 4- 
167 


Store Auxiliary Register 
SAR 4-145 
Store High Accumulator with Shift 
SACH 4-143 
Store High P Register (TMS320C25) 
SPH 4-157 
Store Low Accumulator with Shift 
SACL 4-144 
Store Low P Register (TMS320C25) 
SPL 4-158 
Store Status Register STO 
SST 4-162 
Store Status Register ST1 
SST1 4-164 
STRB- 2-4 
STXM 
Set Serial Port Transmit Mode 4-168 
SUB 
Subtract from Accumulator with 
Shift 4-169 
SUBB 
Subtract from Accumulator with Bor- 
row (TMS320C25) 4-170 
SUBC 5-42, C-3 
Conditional Subtract 4-171 
SUBH 
- Subtract from High Accumulator 4- 
173 
SUBK 
Subtract from Accumulator Short Im- 
mediate (TMS320C25) 4-174 
subroutines 5-7 
SUBS 
Subtract from Low Accumulator with 
Sign-Extension Suppressed 4-175 
SUBT 3-24 
Subtract from Accumulator with Shift 
Specified by T Register 4-176 
Subtract from Accumulator with Shift 
SUB 4-169 
Subtract from High Accumulator 
SUBH 4-173 
Subtract P Register from Accumulator 
SPAC 4-156 
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subtraction 3-25, 5-35, 5-52 
SXF 3-50 

Set External Flag 4-178 
SXM_ 5-31, C-2 
SYNC- 2-5, 3-68 
synchronization 3-68, C-5 
system applications 6-45 
system control 3-28 
system control circuitry 6-3 
system migration C-1 


T 


T register (TR) 
Table Read 
TBLR 4-179 
Table Write 
TBLW 4-181 
TBLR 3-21, 5-23 
Table Read 4-179 
TBLW) 3-21, 5-23 
Table Write 4-181 
temporary register (TR) 
Test Bit 
BIT 4-45 
Test Bit Specified by T Register 
BITT 4-47 
test control flag bit (TC) 3-44 
timer 3-10, 3-44, 5-11, C-5 
timer interrupt (TINT) 3-45, 3-52, 5-11 
timer register (TIM) 3-16, 3-44, 5-11 
timing analysis for interfacing 6-27 
timing control 3-60, 5-11 
TMS320 Design Kit E-8 
TMS320 development tool 
nomenclature E-14 
TMS320 device nomenclature E-13 
TMS320 DSP bulletin board service E-10 
TMS320 DSP hotline E-10 
TMS320C1x to TMS32020 system mi- 
gration C-2 
TMS320C25 1-3 
TMS32020 1-3 
TMS32020 to TMS320C25 system mi- 
gration C-4 
transistors H-5 
transmit framing synchronization signal 
(FSX) 3-56 
transmit mode bit (TXM) 
transmit shift register (XSR) 


3-26, 5-37 


3-1 O, 3-26, 5-37 


3-44, 3-56 
3-10, 3-56 
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transmitted serial data (DX) 3-56 
TRAP 3-52 
Software Interrupt 
two’s-complement data 
5-45, 5-60 
two-word instructions 3-36 


4-183 
5-32, 5-38, 


U 


user target design using XDS__ 6-7 


V 


VAX/IVMX_ E-9 
VCC 2-6 

voice coding 6-46 
VSS 2-6 


Ww 


wait states 3-34, 6-16 
wait-state generator 6-16 


Xx 


XDS design considerations 6-7 
XDS emulator E-5 
XDS/22 upgrade’ E-7 
XF 2-5, 3-44, 3-50 
XINT 3-52, C-7 
XOR 
Exclusive-OR with Accumulator 4- 
184, 
XOR Immediate with Accumulator with 
Shift 
XORK 4-185 
XORK 
XOR Immediate with Accumulator with 
Shift 4-185 
XSR_ 3-56 
X1 2-6 


X2/CLKIN 2-6 
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Z 


ZAC 
Zero Accumulator 4-186 
ZALH 
Zero Low Accumulato: and Load High 
Accumulator 4-187 
ZALR 


Zero Low Accumulator, Load High 
Accumulator with Rounding 
(TMS320C25) 4-188 

ZALS 

Zero Accumulator, Load Low Accu- 
mulator with Sign-Extension Sup- 
pressed 4-189 

Zero Accumulator 

ZAC 4-186 
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Instruction Symbols 


Auxiliary register 
Auxiliary register pointer 
Bit code 
Branch address 
Data memory address or indirect addressing 
control bits (see below) 
Data memory address 
Indirect/direct addressing mode 
1 = indirect; 0 = direct addressing 
Indirect address: {*|'+|*-|"O+|*O-} for ‘20; 
{*]"+]*-]°0+]"0-|*BRO+|*BRO-} for ’C25 
Immediate value 
Port address 
Program memory address 
Shift count 
User-defined items 
Optional! items 


Status Register STO Bits 
15-13 12 11 10 9 876 5 4 3 2 1090 


PARP | ovjovm[ 1 [nrm[ OP 
Status Register ST1 Bits 


| ARB | CNF| TC[sxm{ C [7 1|HMjFSM KF [FO[TXM] PM | 


NOTE: On the TMS32020, bits 5,6, and 9 of ST1 are one’s. 


ARP Auxiliary register pointer 
OV Accumulator overflow flag bit 
OVM Overflow mode bit 


INTM Interrupt mask bit 

DP Data memory page pointer 

ARB Auxiliary register pointer buffer 
CNF On-chip RAM configuration control bit 
TC Test/control flag bit 

SXM ‘Sign-extension mode bit 

Cc Carry bit 

HM Hold mode bit 

FSM Frame synchronization mode bit 
XF XF pin status bit 

FO Format bit 

TXM Transmit mode bit 

PM Product shift mode bits 


Instruction Set Summary 


ADDC 
ADDH 


ADDK 


tCycles using full-speed, on-chip, external program memory. 


Tl Customer Response Center (CRC) Hotline: 
(800) 232-3200 


TMS320 DSP Hotline: 
(713) 274-2320 


TMS320 DSP Bulletin Board Service: 
(713) 274-2323 


Instruction Format Description 
1514131211109 8 7 6 5 4 3 2 


(e) 
uv 
QO 
{e) 
oO 
m 
n 
~ 
: 
a 
M 


ie) 
4 
Q 
o 
Oo 
m™ 


OPCODE 
OPCODE 
OPCODE 
OPCODE 
OPCODE 
OPCODE 
OPCODE 


OPCODE 


OPCODE OPCODE 


Indirect Addressing Control Bits 


6 5 4 3 2 1 0 
IDV Increment/decrement value 
INC Increment flag; 1 increments auxiliary register 


DEC Decrement flag; 1 decrements auxiliary register 
NAR _ New auxiliary register control bit; 1 loads new ARP 
ARP _ Auxiliary register pointer 


<dma>[,<shift>] 
<ind>[,<shift> {.<next ARP>]] 


<dma>; <ind>{,<next ARP>] 
<dma>; <ind>[,<next ARP>] 


<dma> 
<ind>([,<next ARP>] 
<dma> 
<ind>[,<next ARP>] 


rh 


peel 


Instruction Set Summary (Concluded 


RSXM 

RTC 

RTXM 
[Reset external flag 


Opcode 
CEO6 


Operand Options 


R Reset external flag 


SACH Store high accumulator with shift 
SA Store auxiliary register [ses 


1/1 
1/1 
SBLK Subtract from accumulator long 2/2 
1/1 


<dma>[,<shift>] 
<ind>[(,<shift> [<next ARP>]] 


<dma> 6000 
<ind> {,<shift> [<next ARP>]} 


<AR>,<dma> 
<AR>,<ind>[>,<next ARP>] 


<constant> [,<shift>] D003 


7000 


16 


immediate with shift 


SBRK Subtract from auxiliary register 
short immediate 


<constant> 7FOO 10 


N 


S 1/1 
S 


= 
~ 
= 


A 
° o 
3 3 
oO o 


ee E18 

E19 
mode 

N 


SPAC Subtract P register from accumulator } 11 | 


STIQLOLO QFOLaQIAa DIAIOLA 
O}mimim m m o}mimtirn 
Of—]jo]w Ww wo SIOINIw 
Ola] wlio SI = SIOLOIN 


L 
R 
SPH dma>; <ind> [,<next ARP>] 
<dma>; <ind> [,<next ARP>] 
<constant> 
dma>; <ind> [,<next ARP>] 
dma>; <ind> [,<next ARP>] 
SST <dma>; <ind> [,<next ARP>] 
Cc 


| 
B 
Bi 
| 
f 
| 
| 
f 
i 
a 
| 
| 
| 
| 
E 
SI 
eB 
SST1__| Store status register ST) | 1/1 | <dma>; <ind> [/<next ARP>] 
a 
a 
f 
| 
li 
| 
a 
E 
i 


SPL 


~ 
io) 
fo] 
oO 


Store low P register 


xX 
Cc 
F 
F 
H 
P 


Set P register output shift mode 


QO 


(?) BIO alaloINI~n w 

> O|}m mim] m| co] o lm 
ro) o|- NIWlLo;o]o O10 
fo} m -]=lwlrn}olo Ola 


| 


SQRA _| Square and accumulate 
Square and subtract previous product 


5A00 


b 


Store status register STO 


s 
ST 


SSXM 
Set test/control flag 


STXM Set serial port transmit mode 


SUB <dma>[{,<shift> 


<ind> {,<shift> [,<next ARP>]] 


Subtract from accumulator 1/1 
with shift 


4F00 


<dma> 
<ind>[,<next ARP>] 


dma>; <ind> (,<next ARP>] 
dma>; <ind> [,<next ARP>] 
<constant> 


SUBB Subtract from accumulator 1 
with borrow 


SUBC Conditional subtract 


4700 
4400 
CDOO 


SUBH Subtract from high accumulator 


10 


SUBK Subtract from accumulator 1/1 
short immediate 

<dma> 4500 

<ind>[,<next ARP>] 


<dma> 
<ind> [,<next ARP>] 


Shift accumulator right 


4600 


SUBS Subtract from low accumulator 1 
with no sign extension 


1/ 

/ 

SUBT Subtract from accumulator with 1/ 
shift specified by T register 

/ 

i 2/ 


F 
R 
M 
M 
F 


1 
1 
SX Set external flag 
TBLR Table read 
TBLW 
1 
1 


<dma>; <ind> [,<next ARP>] 5800 


dma>; <ind> [,<next ARP>] 5900 


pay 


Store high P register 


TRAP 
XOR Exclusive-OR with accumulator 


XORK Exclusive-OR immediate with 
accumulator with shift 


Oo 


dma>; <ind> (,<next ARP>] 


2 <constant> [,<shift>] DOO6 15 


ZAC 

ZALH Zero low accumulator and load 1/1 
high accumulator 

ZALR Zero low accumulator and load 1/ 
high accumulator with rounding 

ZALS Zero accumulator, load low 1/ 
accumulator with no sign extension 


TCycles using full-speed, on-chip, external program memory. 


<dma> 4000 


<ind>[,<next ARP>] 
<dma> 
<ind>([,<next ARP>]} 
<dma> 
<ind>[,<next ARP>] 


7B00 


Instruction Set Summary (Continued 
Instr Cyct/Wd Operand Options 
ADLK Add to accumulator long 2 <constant>[,<shift>] D002 
immediate with shift 
ADRK Add to auxiliary register 1 <constant> 7E00 10 
short immediate 
AND AND with accumulator <dma>; <ind>[,<next ARP>] 


ANDK AND immediate with accumulator 2 <constant>[,<shift>] D004 15 
with shift 


APAC Add P register to accumulator 1/1 
Branch unconditionally <pma>[, <ind > [,<next ARP>]] FF80 3 


BACC Branch to address specified 3/1 CE25 
by accumulator 


BANZ Branch on auxiliary register not 0 | 3/2 | 


Format 
15 


a 


raanz | <pma>[,<ind>[{,<next ARP>]} FB80 
BBNZ__| Branch if TC bit # 0 | 3/2__| | <pma>[,<ind>[,<next ARP>I] Ease 
rpaz | <pma> [,< ind >[,<next ARP>]] F880 
fac [Branch on cany 8/2 | | <pma>[.<ind>L next ARP>}] | 5E80 


BGEZ Branch if accumulator > O 
Branch if accumulator > 0 <pma>[,<ind>[,<next ARP>]] F180 


<pma>[,<ind>[,<next ARP]] FA80 
Test bit 1/1 <dma>,<bit code> 9000 
<ind>,<bit code>[,<next ARP>] 
TT Test bit specified by T register <dma>; <ind>[,<next ARP>] 5700 


> <pma>([,<ind>[,<next ARP>]] F480 
= .<in 
BLEZ Branch if accumulator < 0 <pma>[,<ind>[,<next ARP>]] F280 3 
in 


BIOZ Branch on I/O status = 0 


B 
B 
B 
B 


°F 


BZ 

Cc 

GZ i ee 

IT 

BLKD Block move from data memory to <dmal >,<dma2> FDOO 
data memory <dmai >,< ind>[,<next ARP>] 

BLKP Block move from program memory to <pma>,<dma> FCOO 
data memory <pma>,<ind> [,<next ARP>] 

Z 
NC 
NV | 3/2 


<pma>[,< ind >[,<next ARP>]] F780 
] 


LACK Load accumulator immediate 1/1 CA00 10 
LACT Load accumulator with shift 1/1 <dma> 4200 
specified by T register <ind> [,<next ARP>] 
LALK Load accumulator long immediate 2 <constant>([, <shift>] 
with shift 
<AR>,<dma> 3000 
<AR>,<ind> [,<next ARP>] 


LAC Load accumulator with shift <dma>[,<shift> 200 
<ind>[,<shift>[{,<next ARP>]] 


Branch if TC bit = 0 


4 

4 
i 
ma>([. 
}BNZ | Branch if accumulator #0 | 3/2 3 
3 
[None SSSCSC*~CS~SCSCS 
3 
[None CRT Tt 
[None Cid 
|CNEP___| Configure block as program memory | _1/1_[|None | CEOS | 
[None —SSSSSCSCS~CSCSCS CT 
| <dma>; <ind>[<nextARP>] __{|_ 5600 _| 
[None CSTE 
7 
PINGING pe eee po ENE | 
cl a 

<ind>,<PA>[,<next ARP>] 

ia | : 
[LACK i 


2/ 
1/ 
2/ 
3/ 
3/ 
3/ 
3/ 
3/ 
3/ 
3/ 
3/ 
3/ 

/ 

/ 
3/ 
3/ 
3/ 
3/ 
3/ 

/ 
1/ 

/ 

/ 

/ 

/ 


2 
2 
2 
2 
2 
2 
2 
2 
2 
2 
2 
2 
2 
2 
1 
1 
1 
1 
2 
1 


LAR Load auxiliary register 


; 
| 
‘ 
| 
i 
| 
i 
[Branch if accumulator<O | 3/2 | | <pma>[,<ind > [,<next ARP>1] 
| 
: 
a 
Z 
| 
| 
| 
| 
ii 
| 
| 
Hi 
| 
| 
i 


tCycles using full-speed, on-chip, external program memory. 


Instruction Set Summary (Continued) 


Instr 


Cyct/Wd 
LARK Load auxiliary register immediate 
LARP Load auxiliary register pointer 


ipti Operand Options 
LD Load data memory page pointer 


cooo | 1 
5588 
<dma>; <ind>[,<next ARP>] 5200 4 


<dma>; <ind>[,<next ARP>} 5300 


Description Format 


P 
pointer immediate 
LPH 
LRLK 
LST 
T 


1 
pooo | 14 
<dma>; <ind>[,<next ARP>] | 5000 | 
<dma>; <ind>[,<next ARP>] 5100 
<dma>; <ind>[,<next ARP>] 3C00 


<dma> 

<ind>[,<next ARP>] 

<dma> 3F00 
<ind>[,<next ARP>} 

<dma> 3E00 
<ind>[,<next ARP>] 

<dma> 5B00 
<ind>[,<next ARP>] 

<pma>,<dma> 

<pma>,<ind>[,<next ARP>] 

<pma>,<dma> 5C00 
<pma>,<ind>[,<next ARP>] 


<dma>; <ind>[,<next ARP>] | 5500 | 


<dma> 3800 
<ind>[,<next ARP>] 


LST1 Load status register ST1 
L Load T register 


LTA Load T register and accumulate 1/1 
previous product 

LTD Load T register, accumulate 1/1 
previous product, move data 

LTP Load T register and store 1 


P register in accumulator 


Load T register and subtract 
previous product 


MAC Multiply and accumulate 
MACD Multiply and accumulate with 4/2 
data move 


MAR Modify auxiliary register 
MPY Multiply (with T register, 1/1 
store product in P register) 
MPYA Multiply and accumulate 1/1 <dma> 3A00 
previous product <ind>[,<next ARP>] 
MPYK__| Multiply immediate | 1/1 [<constant> Cd A000 13 
MPYS Multiply and subtract 1/1 <dma> 3B00 
previous product <ind>[,<next ARP>] 
MPYU ___| Multiply unsigned <dma>; <ind>{,<next ARP>] 3F00 | 4 | 
NEG [None CEB 
1 


[None 8500 
Normalize contents of accumulator CEBO 


OR immediate with accumulator 2/2 <constant> [,<shift> ]} DOO05 
with shift 
E000 


LTS 


4 
4 


0 


je) 
=e) 
=. 
= 
> 
ry 
a 
° 
< 
3 
= 
a 
- 
S 
—_ 
5 
_ 
A 
a 
3 
rt 
aw 
A 
5 
a 
Vv 
A 
3 
2 
x 
ad 
> 
Ps] 
vU 
pas 
b 
1s) 
oO 
°o 


Output data to port 1/1 <dma>,<PA> 
<ind>,<PA>[,<next ARP>] 
Load accumulator with P register 


Pop top of stack to low accumulator 


POPD Pop top of stack to data memory <dma>; <ind>{,<next ARP>] 7A00 


| 
a 
a 
a 
| 
| 
| 
PSHD Push data memory value onto top 1/1 <dma> 
ctstacksomery mivecmte to tL IRIE nent ARP>1 cael 
i 
| 
a 
a 
| 


None 


None 


PUSH Push low accumulator onto stack 
R 


RFSM Reset serial port frame synchronization None CE36 
mode 


RHM Reset hold mode 
ROL Rotate accumulator left 


ROVM 
RPT 


UT 
PAC 
RC 

ET 


RPTK 


tCycles using full-speed, on-chip, external program memory. 


